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GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM 
AND METHODS OF USE THEREOF 



BACKGROUND OF THE TNVFNTTON 



Field of the Invention 



The present invention describes nucleic acid sequences for eukaryotic genes encoding 
€ lycopene e-cyclase (also known as e-cyclase and s lycopene cyclase), isopentenyl 
pyrophosphate isomerase (IPP) and p -carotene hydroxylase as well as vectors containing the 
same and hosts transformed with said vectors. The present invention also provides methods 
for augmenting the accumulation of carotenoids, changing the composition of the 
carotehoids, and producing novel and rare carotenoids. The present invention provides 
methods for controlling the ratio or relative amounts of various carotenoids in a host. The 
invention also relates to modified lycopene e-cyclase, IPP isomerase and p -carotene 
hydroxylase. Additionally, the present invention provides a method for screening for genes 
and cDNAs encoding enzymes of carotenoid biosynthesis and metabolism. 



carotene (or, in rare cases, the asymmetrical bicyclic a -carotene) is intimately associated with 
the photosynthetic reaction centers and plays a vital role in protecting against potentially 
lethal photooxidative damage (Koyama, 1991). P-carotene and other carotenoids derived 
from it or from a -carotene also serve as light-harvesting pigments (Siefermann-Harms, 
1987), are involved in the thermal dissipation of excess light energy captured by the light- 
harvesting antenna (Demmig- Adams & Adams, 1992), provide substrate for the biosynthesis 
of the plant growth regulator abscisic acid (Rock & Zeevaart, 1991; Parry & Horgan, 1991), 
and are precursors of vitamin A in human and animal diets (Krinsky, 1987). Plants also 
exploit carotenoids as coloring agents in flowers and fruits to attract pollinators and agents of 
seed dispersal (Goodwin, 1980). The color provided by carotenoids is also of agronomic 
value in a number of important crops. Carotenoids are currently harvested from a variety of 
organisms, including plants, algae, yeasts, cyanobacteria and bacteria, for use as pigments in 
food and feed. 



Background of the Invention 
Carotenoid pigments with cyclic endgroups are essential components of the 
photosynthetic apparatus in oxygenic photosynthetic organisms (e.g., cyanobacteria, algae 
and plants; Goodwin, 1980). The symmetrical bicyclic yellow carotenoid pigment P- 
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The probable pathway for formation of cyclic carotenoids in plants, algae and 
cyanobacteria is illustrated in Figure 1. Two types of cyclic endgroups or rings are 
commonly found in higher plant carotenoids, these are referred to as the P {beta) and e 
(epsilon) rings (Fig. 3). The precursor acyclic endgroup (no ring structure) is referred to as 
the Y (psi) endgroup. The p and e endgroups differ only in the position of the double bond in 
the ring. Carotenoids with two p rings are ubiquitous, and those with one p and one € ring 
are common, but carotenoids with two e rings are uncommon, p -carotene (Fig. 1) has two P- 
endgroups and is a symmetrical compound that is the precursor of a number of other 
important plant carotenoids such as zeaxanthin and violaxanthin (Fig. 2). 

Genes encoding enzymes of carotenoid biosynthesis have previously been isolated 
from a variety of sources including bacteria (Armstrong et al., 1989, Mol. Gen. Genet. 216, 
254-268; Misawa et al., 1990, J. Bacterid., 172, 6704-12), fungi (Schmidhauser et al., 1990, 
Mol. Cell. Biol. 10, 5064-70), cyanobacteria (Chamovitz et al., 1990, Z. Naturforsch, 45c, 
482-86; Cunningham et al., 1994) and higher plants (Bartley et al., Proc. Natl. Acad. Sci USA 
88, 6532-36; Martinez-Ferez & Vioque, 1992, Plant Mol. Biol. 18, 981-83). Many of the 
isolated enzymes show a great diversity in structure, function and inhibitory properties 
between sources. For example, phytoene desaturases from the cyanobacterium 
Synechococcus and from higher plants and green algae carry out a two-step desaturation to 
yield C-carotene as a reaction product. In plants and cyanobacteria a second enzyme (£- 
carotene desaturase), similar in amino acid sequence to the phytoene desaturase, catalyzes 
two additional desaturations to yield lycopene. In contrast, a single desaturase enzyme from 
Erwinia herbicola and from other bacteria introduces all four double bonds required to form 
lycopene. The Erwinia and other bacterial desaturases bear little amino acid sequence 
similarity to the plant and cyanobacterial desaturase enzymes, and are thought to be of 
unrelated ancestry. Therefore, even with a gene in hand from one source, it may be difficult to 
identify a gene encoding an enzyme of similar function in another organism. In particular, 
the sequence similarity between certain of the prokaryotic and eukaryotic genes encoding 
enzymes of carotenoid biosynthesis is quite low. 

Further, the mechanism of gene expression in prokaryotes and eukaryotes appears to 
differ sufficiently such that one cannot expect that an isolated eukaryotic gene will be 
properly expressed in a prokaryotic host. 
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The difficulties in isolating genes encoding enzymes with similar functions is 
exemplified by recent efforts to isolate the gene encoding the enzyme that catalyzes the 
formation of P -carotene from the acyclic precursor lycopene. Although a gene encoding an 
enzyme with this function had been isolated from a bacterium, it had not been isolated from 
5 any photosynthetic procaryote or from any eukaryotic organism. The isolation and 

characterization of the enzyme catalyzing formation of p -carotene in the cyanobacterium 
Synechococcus PCC7942 was described by the present inventors and others (Cunningham et 
al., 1993 and 1994). The amino acid sequence similarity of the cyanobacterial enzyme to the 
various bacterial lycopene p-cyclases is so low (ca. 18-25% overall; Cunningham et al., 

10 1 994) that there is much uncertainty as to whether they share a common ancestry or, instead, 
represent an example of convergent evolution. 

The need remains for the isolation of eukaryotic and prokaryotic genes and cDNAs 
encoding polypeptides involved in the carotenoid biosynthetic pathway, including those 
encoding a lycopene e-cyclase, IPP isomerase and P -carotene hydroxylase. There remains a 

15 need for methods to enhance the production of carotenoids, to alter the composition of; 

carotenoids, and to reduce or eliminate carotenoid production. There also remains a need in 
the art for methods for screening for genes and cDNAs encoding enzymes of carotenoid 
biosynthesis and metabolism. 

ST JMMARY OF THF mVPNTTON 
20 Accordingly, a first object of this invention is to provide purified and/or isolated 

nucleic acids which encode enzymes involved in carotenoid biosynthesis; in particular, 
lycopene e-cyclase , IPP isomerase and p -carotene hydroxylase. 

A second object of this invention is to provide purified and/or isolated nucleic acids 
which encode enzymes which produce novel or uncommon carotenoids. 
25 A third object of the present invention is to provide vectors containing said genes. 

A fourth object of the present invention is to provide hosts transformed with said 
vectors. * 

Another object of the present invention is to provide hosts which accumulate novel or 
uncommon carotenoids or which accumulate greater amounts of specific or total carotenoids. 
30 Another object of the present invention is to provide hosts with inhibited and/or 

altered carotenoid production. 
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Another object of this invention is to secure the expression of eukaryotic carotenoid- 
related genes in a recombinant prokaryotic host. 

Yet another object of the present invention is to provide a method for screening for 
eukaryotic and prokaryotic genes and cDNAs which encode enzymes involved in carotenoid 
5 biosynthesis and metabolism. 

An additional object of the invention is to provide a method for manipulating 
carotenoid biosynthesis in photosynthetic organisms by inhibiting the synthesis of certain 
enzymatic products to cause accumulation of precursor compounds. 

Another object of the invention is to provide modified lycopene e-cyclase, IPP 
10 isomerase and p -carotene hydroxylase. 

These and other objects of the present invention have been realized by the present 
inventors as described below. 

A subject of the present invention is an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p -carotene 
1 5 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21 or 23-27. 

The invention also includes vectors which comprise any of the nucleic acid sequences 
listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of producing or enhancing the 

20 production of a carotenoid in a host cell, comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase, DPP isomerase or p -carotene hydroxylase enzyme activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence to produce the protein. 

25 Yet another subject of the present invention is a method of modifying the production 

of carotenoids in a host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isomerase or p -carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 

30 sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 

sequence in the host cell to modify the production of the carotenoids in the host cell, relative 
to the untransformed host cell. 
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The present invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein having lycopene e-cyclase, 
IPP isomerase or (J -carotene hydroxylase enzyme activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, wherein the 
5 heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous nucleic acid 
sequence which encodes for a protein which modifies lycopene e-cyclase, IPP isomerase or 
p -carotene hydroxylase enzyme activity in the host cell, relative to an untransformed host 
10 cell, the method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is a method for screening for genes and 
cDNAs which encode enzymes involved in carotenoid biosynthesis and metabolism. 

15 BEIEE PESOS TPTTON OF THF DRAWINGS 

A more complete appreciation of the invention and many of the attendant advantages 
thereof will be readily obtained as the same becomes better understood by reference to the 
following detailed description when considered in connection with the accompanying 
drawings, wherein: 

20 Figure 1 is a schematic representation of the putative pathway of P -carotene 

biosynthesis in cyanobacteria, algae and plants. The enzymes catalyzing various steps are 
indicated at the left. Target sites of the bleaching herbicides NFZ and MPTA are also 
indicated at the left. Abbreviations: DMAPP, dimethylallyl pyrophosphate; FPP, farnesyl 
pyrophosphate; GGPP, geranylgeranyl pyrophosphate; GPP, geranyl pyrophosphate; IPP, 

25 isopentenyl pyrophosphate; LCY, lycopene cyclase; MVA, mevalonic acid; MPTA, 2-(4- 
methylphenoxy)triethylamine hydrochloride; NFZ, norflurazon; PDS, phytoene desaturase; 
PSY, phytoene synthase; ZDS, C-carotene desaturase; PPPP, prephytoene pyrophosphate. 

Figure 2 depicts possible routes of synthesis of cyclic carotenoids and common plant 
and algal xanthophylls (oxycarotenolds) from neurosporene. Demonstrated activities of the 

30 P- and e-cyclase enzymes of A. thaliana are indicated by bold arrows labelled with P or € 
respectively. A bar below the arrow leading to e-carotene indicates that the enzymatic 
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activity was examined but no product was detected. The steps marked by an arrow with a 
dotted line have not been specifically examined. Conventional numbering of the carbon 
atoms is given for neurosporene and a -carotene. Inverted triangles (T) mark positions of the 
double bonds introduced as a consequence of the desaturation reactions. 
5 Figure 3 depicts the carotene endgroups which are found in plants. 

Figure 4 is a DNA sequence and the predicted amino acid sequence of a lycopene €- 
cyclase cDNA isolated from A. thaliana (SEQ ID NOS: 1 and 2). These sequences were 
deposited under Genbank accession number U50738. This cDNA is incorporated into the 
plasmid pATeps. 

10 Figure 5 is a DNA sequence encoding the p -carotene hydroxylase isolated from A. 

thaliana (SEQ ID NO: 3). This cDNA is incorporated into the plasmid pATOHB. 

Figure 6 is an alignment of the predicted amino acid sequences of A. thaliana p- 

carotene hydroxylase (SEQ ID NO: 4) with those of the bacterial P-carotene hydroxylase 

enzymes from Alicalgenes sp. (SEQ ID NO: 5) (Genbank D58422), Erwinia herbicola EholO 
15 (SEQ ID NO.: 6) (GenBank M872280), Erwinia uredovora (SEQ ID NO.: 7) (GenBank 

D90087) and Agrobacterium aurianticum (SEQ ID NO.: 8) (GenBank D58420). A 

consensus sequence is also shown. All five genes are identical where a capital letter appears 

in the consensus. A lowercase letter indicates that three of five, including A. thaliana, have 

the identical residue. TM; transmembrane. 
20 Figure 7 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from A. 

thaliana (SEQ ID NO: 9). This cDNA is incorporated into the plasmid pATDP5. 

Figure 8 is a DNA sequence of a second cDNA encoding another IPP isomerase 

isolated from A. thaliana (SEQ ID NO: 10). This cDNA is incorporated into the plasmid 

pATDP7. 

25 Figure 9 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from 

Haematococcus pluvialis (SEQ ID NO: 1 1). This cDNA is incorporated into the plasmid 
pHP04. 

Figure 10 is a DNA sequence of a second cDNA encoding another IPP isomerase 
isolated from Haematococcus pluvialis (SEQ ID NO: 12). This cDNA is incorporated into 
30 the plasmid pHP05. 

Figure 1 1 is an alignment of the amino acid sequences predicted by IPP isomerase 
cDNAs isolated from A thaliana (SEQ ID NO.: 16 and 18), H. pluvialis (SEQ ID NOS.: 14 
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and 15), Clarkia breweri (SEQID NO.: 17) (See, Blanc & Pichersky, Plant Physiol. (1995) 
108:855; Genbank accession no. X82627) and Saccharomyces cerevisiae (SEQ ID NO.: 19) 



Figure 12 is a DNA sequence of the cDNA encoding an IPP isomerase isolated from 
Tagetes erecta (marigold; SEQ ED NO: 13). This cDNA is incorporated into the plasmid 
pPMDPl . xxx's denote a region not originally sequenced. Figure 21 A shows the complete 
marigold sequence. 

Figure 13 is an alignment of the consensus sequence of four plant p-cyclases (SEQ ID 
NO.: 20) with the A. thaliana lycopene e-cyclase (SEQ ID NO.: 21). A capital letter in the 
plant p consensus is used where all four P -cyclase genes predict the same amino acid residue 
in this position. A small letter indicates that an identical residue was found in three of the 
four. Dashes indicate that the amino acid residue was not conserved and dots in the sequence 
denote a gap. A consensus for the aligned sequences is given, in capital letters below the 
alignment, where the p- and e-cyclases have the same amino acid residue. Arrows indicate 
some of the conserved amino acids that will be used as junction sites for construction of * 
chimeric cyclases with novel enzymatic activities. Several regions of interest including a 
sequence signature indicative of a dinucleotide-binding motif and two predicted 
transmembrane (TM) helical regions are indicated below the alignment and are underlined. 

Figure 14 shows the nucleotide (SEQ ID NO:22) and amino acid sequences (SEQ ID 
NO:23) of the Adonis palaestina (pheasant's eye) e-cyclase cDNA #5. 

Figure 15A shows the nucleotide (SEQ ID NO:24) and amino acid sequences (SEQ 
ID NO:25) of a potato e-cyclase cDNA. Figure 15B shows the amino acid sequence (SEQ ID 
NO:26) of a chimeric lettuce/potato lycopene e-cyclase. Amino acids in lower case are from 
the lettuce cDNA and those in upper case are from the potato cDNA. The product of this 
chimeric cDNA has e-cyclase activity and converts lycopene to the monocyclic 8-carotene. 

Figure 16 shows a comparison between the amino acid sequences of the Arabidopsis 
e-cyclase (SEQ ID NO:27) and the potato e-cyclase (SEQ ID NO:25). 

Figure 17A shows the nucleotide sequence of the Adonis palaestina Ipil (SEQ ID 
NO:28) and Figure 17B shows the nucleotide sequence of the Adonis palaestina Ipi2 (SEQ 



(Genbank accession no. J05090). 



ID NO: 29). 
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Figure 1 8 A shows the nucleotide sequence of the Haematoccus pluvialis Ipil (SEQ 
ID NO:l 1) and Figure 1 8B shows the nucleotide sequence of the Haematoccus pluvialis Ipi2 
(SEQ ID NO:30). 

Figure 19A shows the nucleotide sequence of the Lactuca sativa (romaine lettuce) 
Ipil (SEQ ID NO:31) and Figure 19B shows the nucleotide sequence of the Lactuca sativa 
Ipi2 (SEQ ID NO: 32). 

Figure 20 shows the nucleotide sequence of the Chlamydomonas reinhardtii Ipil 
(SEQ ID NO:33). 

Figure 21 A shows the nucleotide sequence of the Tagetes erecta (marigold) Ipil (SEQ 
ID NO:34) and Figure 21B shows the nucleotide sequence of the Oryza sativa (rice) Ipil 
(SEQ ID NO:35). 

Figure 22 shows a amino acid sequence alignment of various plant and green algal 
isopentenyl isomerases (IPI) (SEQ ID NOS:16, 36-45). 

Figure 23 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
cDNA #5 nucleotide sequences. 

Figure 24 shows a comparison between^ donis palaestina e-cyclase cDNA #3 and 
cDNA #5 predicted amino acid sequences. 

Figure 25 shows a sequence alignment of various plant (J- and e-cyclases. Those 
sequences outlined in grey denote identical sequences among the e-cyclases. Those 
sequences outlined in black denote identical sequences among both the P- and e-cyclases. 

Figure 26 shows a sequence alignment of the plant e-cyclases from Figure 25. Those 
sequences outlined in black denote identical sequences among the e-cyclases. 

Figure 27 is a dendrogram or ft4 tree" illustrating the degree of amino acid sequence 
similarity for various lycopene 0- and e-cyclases. 

Figure 28 shows a comparison between Arabidopsis e-cyclase and lettuce e-cyclase 
predicted amino acid sequences. 



The present invention includes an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p -carotene 
hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21, 23 or 25-27. Nucleic acids encoding lycopene e-cyclase, p -carotene hydroxylase and IPP 
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isomerases have been isolated from several genetically distant sources. 

The present inventors have isolated nucleic acids encoding the enzyme IPP isomerase, 
which catalyzes the reversible conversion of isopentenyl pyrophosphate (IPP) to 
dimethylallyl pyrophosphate (DMAPP). IPP isomerase cDNAs were isolated from the plants 
5 A. thaliana, Tagetes erecta (marigold), Adonis palaestina (pheasant's eye), Lactuca sativa 
(romaine lettuce) and from the green algae H. pluvialis and Chlamydomonas reinhardtii. 
Alignments of the amino acid sequences predicted by some of these cDNAs are shown in 
Figures 12 and 22. Plasmids containing some of these cDNAs were deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on March 4, 

10 1996 under ATCC accession numbers 98000 (pHP05 - K pluvialis); 98001 (pMDPl - 
marigold); 98002 (pATDP7 - A. thaliana) and 98004 (pHP04 - if. pluvialis). 

The present inventors have also isolated nucleic acids encoding the enzyme P- 
carotene hydroxylase, which is responsible for hydroxylating the P-endgroup in carotenoids. 
The nucleic acid of the present invention is shown in SEQ ID NO: 3 and Figure 5. The full 

15 length cDNA product hydroxylates both end groups of p -carotene as do products of cDNAs 
which encode proteins truncated by up to 50 amino acids from the N-terminus. Products of 
genes which encode proteins truncated between about 60-1 10 amino acids from the N- 
terminus preferentially hydroxylate only one ring. A plasmid containing this gene was 
deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 

20 20852 on March 4, 1996 under ATCC accession number 98003 (pATOHB - A. thaliana). 

The present inventors have also isolated nucleic acids encoding the enzyme lycopene 
e-cyclase, which is responsible for the formation of €-endgroups in carotenoids. The A 
thaliane e-cyclase adds an e ring to only one end of the symmetrical lycopene while the 
related p -cyclase adds a ring at both ends. The A thaliana cDNA of the present invention is 

25 shown in Figure 4 and SEQ ID NO: 1 . A plasmid containing this gene was deposited with 
the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on 
March 4, 1996 under ATCC accession number 98005 (pATeps - A. thaliana). 

In addition, lycopene e-cyclases have been identified in lettuce and in Adonis 
palaestina (cDNA #5) which encode enzymes that convert lycopene to the bicyclic e- 

30 carotene (e,e-carotene). An additional cDNA from Adonis palaestina (cDNA #3) encodes a 
lycopene e-cyclase which converts lycopene into 6 -carotene (e,i[r -carotene) and differs from 
the lycopene e-cyclase which forms bicyclic e-carotene (e,e-carotene) by only 5 amino acids. 

-9- 

BNSDOCID: <WO 9963055A1_I_> 



10 



15 



20 



25 



30 



WO 99/63055 W Q PCT/US99/12121 

One or more of these amino acids may be modified by alteration of the nucleotide sequence " 
in the #5 cDNA to obtain an enzyme which forms the bicyclic e,e-carotene. The sequences 
of the Adonis palaestina and Arabidopsis thaliana €-cyclases have about 70% nucleotide 
identity and about 72% amino acid identity. 

Initial experiments by the inventors with chimeric genes indicated that the part of the 
e-cyclase which is responsible for adding 2 e rings to form e,e-carotene is the carboxy 
terminal portion of the gene. The lettuce e-cyclase adds two e rings to form e,e-carotene. A 
DNA encoding a partial potato e-cyclase (missing its amino terminal portion), when 
combined with an amino terminal region from the lettuce e-cyclase gene, produces a 
monocyclic 8-carotene (e,tj,-carotene). With the discovery of the differences between the 
Adonis palaestina clone #3 and clone #5, the specific amino acids responsible for the addition 
of an extra e ring have been identified (Figure 24). Specifically, amino acid 55 is Thr in 
clone #3 and Ser in clone #5, amino acid 210 is Asn in clone #3 and Asp in clone #5, amino 
acid 231 is Asp in clone #3 and Glu in clone #5, amino acid 352 is He in clone #3 and Val in 
clone #5, and amino acid 524 is Lys in clone #3 and Arg in clone #5. It can be appreciated 
that these changes are quite conservative, as only one change, at amino acid 210, changes the 
charge of the protein. 

Thus, it is clear that the nucleic acids of the invention encoding the enzymes as 
presently disclosed may be altered to increase a particularly desirable property of the enzyme, 
to change a property of the enzyme, or to diminish an undesirable property of the enzyme. 
Such modifications can be by deletion, substitution, or insertion of one or more amino acids, 
and can be performed by routine enzymatic manipulation of the nucleic acid encoding the 
enzyme (such as by restriction enzyme digestion, removal of nucleotides by mung bean 
nuclease or 2ta/31, insertion of nucleotides by Klenow fragment, and by religation of the 
ends), by site-directed mutagenesis, or may be accidental, such as by low fidelity PCR or 
those obtained through mutations in hosts that are producers of the enzymes. These 
techniques as well as other suitable techniques are well known in the art. 

Mutations can be made in the nucleic acids of the invention such that a particular 
codon is changed to a codon which codes for a different amino acid. Such a mutation is 
generally made by making the fewest nucleotide changes possible. A substitution mutation 
of this sort can be made to change an amino acid in the resulting protein in a non- 
conservative manner (i.e., by changing the codon from an amino acid belonging to a grouping 
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of amino acids having a particular size or characteristic to an amino acid belonging to another 
grouping) or in a conservative manner (i.e., by changing the codon from an amino acid 
belonging to a grouping of amino acids having a particular size or characteristic to an amino 
acid belonging to the same grouping). Such a conservative change generally leads to less 
5 change in the structure and function of the resulting protein. A non-conservative change is 
more likely to alter the structure, activity or function of the resulting protein. The present 
invention should be considered to include sequences containing conservative changes which 
do not significantly alter the activity or binding characteristics of the resulting protein. 
The following is one example of various groupings of amino acids: 
10 Amino acids with nonpolar R groups: Alanine, Valine, Leucine, Isoleucine, Proline, 
Phenylalanine, Tryptophan and Methionine. 

Amino acids with uncharged polar R groups : Glycine, Serine, Threonine, Cysteine, Tyrosine, 
Asparagine and Glutamine. 

Amino acids with charged polar R groups (negatively charged at Ph 6.0): Aspartic acid and, 
15 Glutamic acid. 

Basic am ino acids (positively charged at pH 6.0): Lysine, Arginine and Histidine. 

Another grouping may be those amino acids with phenyl groups: Phenylalanine, 
Tryptophan and Tyrosine, 

Another grouping may be according to molecular weight (i.e., size of R groups). 
20 Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH 2 can be maintained. 

25 Amino acid substitutions may also be introduced to substitute an amino acid with a 

particularly preferable property. For example, a Cys may be introduced to provide a potential 
site for disulfide bridges with another Cys. A His may be introduced as a particularly 
"catalytic" site (i.e., His can act as an acid or base and is the most common amino acid in 
biochemical catalysis). Pro may be introduced because of its particularly planar structure, 

30 which induces p -turns in the protein's structure. 

It is clear that certain modifications of SEQ ID NOS: 2, 4, 14-21, 23 or 25-27 can take 
place without destroying the activity of the enzyme. It is noted especially that truncated 
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versions of the nucleic acids of the invention are functional. For example, several amino 
acids (from 1 to about 120) can be deleted from the N-terminus of the lycopene e-cyclases of 
the invention, and a functional protein can still be produced. This fact is made especially 
clear from Figure 25, which shows a sequence alignment of several plant €-cyclases. As can 
5 be seen from Figure 25, there is an enormous amount of sequence disparity between amino 
acid sequences 2 to about 50-70 (depending on the particular sequence, since gaps are 
present). There is less, but also a substantial amount of, sequence dissimilarity between about 
50-70 to about 90-120 (depending on the particular sequence). Thereafter, the sequences are 
fairly conserved, except for small pockets of dissimilarity between about 275-295 to about 

10 285-305 (depending on the particular sequence), and between about 395-415 to about 410- 
430 (depending on the particular sequence). 

The present inventors have found that the amount of the 5' region present in the 
nucleic acids of the invention can alter the activity of the enzyme. Instead of diminishing 
activity, truncating the 5' region of the nucleic acids of the invention may result in an enzyme 

15 with a different specificity. Thus, the present invention relates to nucleic acids and enzymes 
encoded thereby which are truncated to within 0-50, preferably 0-25, codons of the 5' 
initiation codon of their prokaiyotic counterparts as determined by alignment maps as 
discussed below. 

For example, when the cDNA encoding^, thaliana p -carotene hydroxylase was 
20 truncated, the resulting enzyme catalyzed the formation of P-cryptoxanthin as the major 

product and zeaxanthin as minor product; in contrast to its normal production of zeaxanthin. 

The present invention is intended to include those nucleic acid and amino acid 
sequences in which substitutions, deletions, additions or other modifications have taken 
place, as compared to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27, without destroying the activity 
25 of the enzyme. Preferably, the substitutions, deletions, additions or other modifications take 
place at the 5* end, or any other of those positions which already show dissimilarity between 
any of the presently disclosed amino acid sequences (see also Figure 25) or other amino acid 
sequences which are known in the art and which encode the same enzyme (i.e., lycopene e- 
cyclase, EPP isomerase or p -carotene hydroxylase). 
30 In each case, nucleic acid and amino acid sequence similarity and identity is measured 

using sequence analysis software, for example, the Sequence Analysis, Gap, or BestFit 
software packages of the Genetics Computer Group (University of Wisconsin Biotechnology 
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Center, 1710 University Avenue, Madison, Wisconsin 53705), MEGAligh (DNAStar, Inc., " 
1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 
S. Bascom Avenue, Suite 200, Campbell, California 95008). Such software uses algorithms 
to match similar sequences by assigning degrees of identity to various substitutions, 
5 deletions, and other modifications, and includes detailed instructions as to useful parameters, 
etc., such that those of routine skill in the art can easily compare sequence similarities and 
identities. An example of a useful algorithm in this regard is the algorithm of Needleman and 
Wunsch, which is used in the Gap program discussed above. This program finds the 
alignment of two complete sequences that maximizes the number of matches and minimizes 

10 the number of gaps. Another useful algorithm is the algorithm of Smith and Waterman, 
which is used in the BestFit program discussed above. This program creates an optimal 
alignment of the best segment of similarity between two sequences. Optimal alignments are 
found by inserting gaps to maximize the number of matches using the local homology 
algorithm of Smith and Waterman. 

15 Conservative (i.e. similar) substitutions typically include substitutions within the 

following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, glutamic 
acid, asparagine and glutamine; serine and threonine; lysine and arginine; and phenylalanine 
and tyrosine. Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle, J. Mol Biol 157: 105-132 (1982)), or on the basis of 

20 the ability to assume similar polypeptide secondary structure (see Chou and Fasman, Adv. 
Enzymol 47: 45-148 (1978)). 

If comparison is made between nucleotide sequences, preferably the length of 
comparison sequences is at least 50 nucleotides, more preferably at least 60 nucleotides, at 
least 75 nucleotides or at least 100 nucleotides. It is most preferred if comparison is made 

25 between the nucleic acid sequences encoding the enzyme coding regions necessary for 

enzyme activity. If comparison is made between amino acid sequences, preferably the length 
of comparison is at least 20 amino acids, more preferably at least 30 amino acids, at least 40 
amino acids or at least 50 amino acids. It is most preferred if comparison is made between 
the amino acid sequences in the enzyme coding regions necessary for enzyme activity. 

30 It should be appreciated that also within the scope of the present invention are nucleic 

acid sequences encoding lycopene € -cyclases, IPP isomerases and p -carotene hydroxylases 
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which code for enzymes having the same amino acid sequence as SEQ ID NOS: 2, 4, 14-21, 
23 or 25-27, but which are degenerate to the nucleic acids specifically disclosed herein. 

The amino acid residues described herein are preferred to be in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino acid residue, 



Volumes I-IH [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & S J. Higgins eds. (1985)]; "Transcription And 

15 Translation" [B.D. Hames & S.J. Higgins, eds. (1984)]; "Animal Cell Culture" [R.I. 

Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A 
Practical Guide To Molecular Cloning" (1984). 

The present invention also includes vectors. Suitable vectors according to the present 
invention comprise a nucleic acid of the invention encoding an enzyme involved in 

20 carotenoid biosynthesis or metabolism and a suitable promoter for the host, and can be 

constructed using techniques well known in the art (for example Sambrook et al., Molecular 
Cloning A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 
1989; Ausubel et al., Current Protocols in Molecular Biology , Greene Publishing and Wiley , 
Interscience, New York, 1991). Suitable vectors for eukaryotic expression in plants are 

25 described in Frey et al., Plant J. (1995) 8(5):693 and Misawa et al, 1994a; incorporated herein 
by reference. Suitable vectors for prokaryotic expression include pACYCl 84, pUCl 1 9, and 
pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis (Invitrogen) and 
pET28 (Novagen) and derivatives thereof. The vectors of the present invention can 
additionally contain regulatory elements such as promoters, repressors, selectable markers 

30 such as antibiotic resistance genes, etc. 

The nucleic acids encoding the carotenoid enzymes as described above, when cloned 
into a suitable expression vector, can be used to overexpress these enzymes in a plant 



5 



as long as the desired functional property of immunoglobulin-binding is retained by the 
polypeptide. 



10 



In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the 
art. Such techniques are explained fully in the literature. See, e.g., Sambrook et al, 
"Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in Molecular 
Biology" Volumes Mil [Ausubel, R. M., ed. (1994)]; "Cell Biology: A Laboratory 
Handbook" Volumes I-HI [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" 
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expression system or to inhibit the expression of these enzymes. For example, a vector 
containing the gene encoding lycopene e-cyclase can be used to increase the amount of a- 
carotene and carotenoids derived from a-carotene (such as lutein and ot-cryptoxanthin) in an 
organism and thereby alter the nutritional value, pharmacology and visual appearance value 
5 of the organism. 

Therefore, the present invention includes a method of producing or enhancing the 
production of a carotenoid in a host cell, relative to an untransformed host cell, the method 
comprising inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having lycopene e-cyclase, IPP isomerase or P- 

10 carotene hydroxylase enzyme activity, wherein the heterologous nucleic acid sequence is 
operably linked to a promoter; and expressing the heterologous nucleic acid sequence to 
produce the protein. 

The present invention also includes a method of modifying the production of 
carotenoids in a host cell, the method comprising inserting into the host cell a vector 

15 comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isomerase or p -carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 
sequence in the host cell to modify the production of the carotenoids in the host cell, relative 

20 to the untransformed host cell. 

The term "modifying the production" means that the amount of carotenoids produced 
in the host cell can be enhanced, reduced, or left the same, as compared to the untransformed 
host cell. In accordance with one embodiment of the present invention, the make-up of the 
carotenoids (i.e., the specific carotenoids produced) is changed vis a vis each other, and this 

25 change in make-up may result in either a net gain, net loss, or no net change in the total 

amount of carotenoids produced in the cell. In accordance with another embodiment of the 
present invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) is enhanced by the insertion of an enzyme-encoding 
nucleic acid of the invention^ In yet another embodiment of the invention, the production or 

30 the biochemical activity of the carotenoids (or the enzymes which catalyze their formation) 
may be reduced or inhibited by a number of different approaches available to those skilled in 
the art, including but not limited to such methodologies or approaches as anti-sense (e.g., 
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Gray et al (1992) Plant Mol. Biol. 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. 
Gen. Genet. 245:465-470), co-suppression (e.g., Fray and Grierson (1993) Plant Mol. Biol. 
22:589-602), targeted disruption of the gene (e.g., Schaefer et al. (1997) Plant J. 11:1195- 
1206), intracellular antibodies (e.g., Rondon and Marasco (1997) Ann. Rev. Microbiol. 
5 51 :257-283) or whatever other approaches rely on the knowledge or availability of the nucleic 
acid or amino acid sequences of the invention and/or portions thereof, to thereby reduce 
accumulation of carotenoids with e rings and compounds derived from them (for e-cyclase 
inhibition), or carotenoids with hydroxylated p rings and compounds derived from them (for 
p -hydroxylase inhibition), or, in the case if IPP isomerase, accumulation of any isoprenoid 
10 compound. 

Preferably, at least a portion of the nucleic acid sequences used in the methods, 
vectors and host cells of the invention codes for an enzyme having an amino acid sequence 
which is at least 85% identical, preferably at least 90%, at least 95% or completely identical 
to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27. Sequence identity is determined as noted above. 

1 5 Preferably, sequence additions, deletions or other modifications are made as indicated above, 
so as to not affect the function of the particular enzyme. 

In a preferred embodiment, vectors are manufactured which contain a DNA encoding 
a eukaryotic IPP isomerase upstream of a DNA encoding a second eukaryotic carotenoid 
enzyme. The inventors have discovered that inclusion of an IPP isomerase gene increases the 

20 supply of substrate for the carotenoid pathway; thereby enhancing the production of 

carotenoid endproducts, as compared to a host cell which is not transformed with such a 
vector. This is apparent from the much deeper pigmentation in carotenoid-accumulating 
colonies of E. coli which also contain one of the aforementioned IPP isomerase genes when 
compared to colonies that lack this additional IPP isomerase gene. Similarly, a vector 

25 comprising an IPP isomerase gene can be used to enhance production of any secondary 
metabolite of dimethylallyl pyrophosphate and/or isopentenyl pyrophosphate (such as 
isoprenoids, steroids, carotenoids, etc.). The term "isoprenoid" is intended to mean any 
member of the class of naturally occurring compounds whose carbon skeletons are composed, 
in part or entirely, of isopentyl C 5 units. Preferably, the carbon skeleton is of an essential oil, 

30 a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 

A vector containing the cDNA encoding a lycopene €-cyclase of the invention, 
preferably the lettuce lycopene e-cyclase or Adonis €-cyclase #5, can be used to increase the 
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amount of bicyclic e-carotene in an organism and thereby alter the nutritional value, 
pharmacology and visual appearance value of the organism. In addition, the transformed 
organism can be used in the formulation of therapeutic agents, for example in the treatment of 
cancer (see Mayne et al (1996) FASEB J. 10:690-701; Tsushima et al (1995) Biol. Pharm. 
5 Bull. 18:227-233). 

An antisense strand of a nucleic acid of the invention can be inserted into a vector. 
For example, the lycopene e-cyclase gene can be inserted into a vector and incorporated into 
the genomic DNA of a host, thereby inhibiting the synthesis of e,P -carotenoids (lutein and a- 
carotene) and enhancing the synthesis of P,P -carotenoids (zeaxanthin and p -carotene). 

10 The present invention also relates to novel enzymes which are encoded by the amino 

acid sequences of the invention, or portions thereof. 

The present invention also relates to novel enzymes which can transform known 
carotenoids into novel or uncommon products. Currently €-carotene (see Figure 2) and y- 
carotene are commonly produced only in minor amounts. As described below, an enzyme 

15 can be produced which transforms lycopene to y -carotene and lycopene to e-carotene. 1 With 
these products in hand, bulk synthesis of other carotenoids derived from them are possible. 
For example, e-carotene can be hydroxylated to form lactucaxanthin, ah isomer of lutein (one 
e and one p ring) and zeaxanthin (two p rings) where both endgroups are, instead, e rings. 

In addition to novel enzymes produced by truncating the 5* region of known enzymes, 

20 as discussed above, novel enzymes which can participate in the formation of unusual 

carotenoids can be formed by replacing portions of one gene with an analogous sequence 
from a structurally related gene. For example, p -cyclase and e-cyclase are structurally 
related (see Figure 13). By replacing a portion of p -lycopene cyclase with the analogous 
portion of e-cyclase, an enzyme which produces y -carotene will be produced (one P 

25 endgroup). Further, by replacing a portion of the lycopene e-cyclase with the analogous 
portion of p -cyclase, an enzyme which produces e-carotene will be produced (with some 
exceptions, such as the lettuce e-cyclase, plant e-cyclases normally produce a compound with 
one e-endgroup, 6-carotene). Similarly, P-hydroxylase could be modified to produce 
enzymes of novel function by creation of hybrids with e-hydroxylase. 

30 Host systems according to the present invention can comprise any organism that 

already produces carotenoids or which has been genetically modified to produce carotenoids. 
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The IPP isomerase genes are more broadly applicable for enhancing production of any 
product dependent on DMAPP and/or IPP as a precursor. 

Organisms which already produce carotenoids include plants, algae, some yeasts, 
fungi and cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
5 with vectors according to the present invention can be done using standard techniques such as 
those described in Misawa et al., (1990) supra; Hundle et al., (1993) supra; Hundle et al., 
(1991) supra; Misawa et al., (1991) supra; Sandmann et al., supra; and Schnurr et al., supra. 

Transgenic organisms can be constructed which include the nucleic acid sequences of 
the present invention (Bird et al, 1991; Bramley et al, 1992; Misawa et al, 1994a; Misawa et 
10 al, 1994b; Cunningham et al, 1993). The incorporation of these sequences can allow the 
controlling of carotenoid biosynthesis, content, or composition in the host cell. These 
transgenic systems can be constructed to incorporate sequences which allow for the 
overexpression of the nucleic acids of the present invention. Transgenic systems can also be 
constructed containing antisense expression of the nucleic acid sequences of the present 
15 invention. Such antisense expression would result in the accumulation of the substrates of the 
substrates of the enzyme encoded by the sense strand. 

A method for screening for eukaryotic genes which encode enzymes involved in 
carotenoid biosynthesis comprises transforming a prokaryotic host with a nucleic acid which 
may contain a eukaryotic or prokaryotic carotenoid biosynthetic gene; culturing said 
20 transformed host to obtain colonies; and screening for colonies exhibiting a different color 
than colonies of the untransformed host. 

Suitable hosts include E. coli, cyanobacteria such as Synechococcus and 
Synechocystis, alga and plant cells. E. coli are preferred. 

In a preferred embodiment, the above "color complementation" screening protocol can 
25 be enhanced by using mutants which are either (1) deficient in at least one carotenoid 

biosynthetic gene or (2) overexpress at least one carotenoid biosynthetic gene. In either case, 
such mutants will accumulate carotenoid precursors. 

Prokaryotic and eukaryotic DNA or cDNA libraries can be screened in total for the 
presence of genes of carotenoid biosynthesis, metabolism and degradation. Preferred 
30 organisms to be screened include photosynthetic organisms. 
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E. coli can be transformed with these eukaryotic cDNA libraries using conventional 
methods such as those described in Sambrook et al, 1989 and according to protocols 
described by the vendors of the cloning vectors. 

For example, the cDNA libraries in bacteriophage vectors such as lambdaZAP 
5 (Stratagene) or lambda ZIPLOX (Gibco BRL) can be excised en masse and used to transform 
E.coli. 

Transformed E. coli can be cultured using conventional techniques. The culture broth 
preferably contains antibiotics to select and maintain plasmids. Suitable antibiotics include 
penicillin, ampicillin, chloramphenicol, etc. Culturing is typically conducted at 15-40°C, 

10 preferably at room temperature or slightly above (18-28°C), for 12 hours to 7 days. 

Cultures are plated and the plates are screened visually for colonies with a different 
color than the colonies of the host E. coli transformed with the empty plasmid cloning vector. 
For example, E. coli transformed with the plasmid, pAC-BETA (described below), produce 
yellow colonies that accumulate p -carotene. After transformation with a cDNA library, 

15 colonies which contain a different hue than those formed by E. co/i/pAC-BETA would be 
expected to contain enzymes which modify the structure or accumulation of p -carotene. 
Similar E. coli strains can be engineered which accumulate earlier products in carotenoid 
biosynthesis, such as lycopene, y -carotene, etc. 

Having generally described this invention, a further understanding can be obtained by 

20 reference to certain specific examples which are provided herein for purposes of illustration 
only and are not intended to be limiting unless otherwise specified. 

EXAMPLE 

I. Isolation of p -carotene hydroxylase 
Plasmid Construction 

25 An 8.6kb Bglll fragment containing the carotenoid biosynthetic genes of Erwinia 

herbicola was first cloned in the BamHI site of plasmid vector pACYCl 84 (chloramphenicol 
resistant), and then a 1.1 kb BamHI fragment containing the E. herbicola p -carotene 
hydroxylase (CrtZ) was deleted. E.coli strains containing the resulting plasmid, p AC-BETA, 
accumulate 6-carotene and form yellow colonies (Cunningham et aL, 1994). 

30 A full length cDNA encoding IPP isomerase of Haematococcus pluvialis (HP04) was 

first excised with BamHI and Kpnl from pBluescript SK-, and then ligated into the 
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corresponding sites of the pTrcHisA vector with high-level expression from the trc promoter 
(Invitrogen, Inc.). A fragment containing the IPP isomerase and trc promoter was 
subsequently excised with EcoKV and Kpnl y treated with the Klenow fragment of DNA 
polymerase to produce blunt ends, and ligated in the Klenow-treated Hindlll site of pAC- 
BETA. E.coli cells transformed with this new plasmid pAC-BETA-04 form orange colonies 
on LB plates (vs. yellow for those containing pAC-BETA) and cultures accumulate 
substantially more p -carotene (ca. two fold) than those that contain p AC-BETA. 

Screening of an Arahidopsis cDNA Library 

Several X cDNA expression libraries of Arahidopsis were obtained from the 
Arahidopsis Biological Resource Center (Ohio State University, Columbus, OH) (Kieber et 
al., 1993). The X cDNA libraries were excised in vivo using Stratagene's ExAssist SOLR 
system to produce a phagemid cDNA library wherein each phagemid contained also a gene 
conferring resistance to the antibiotic ampicillin. 

E.coli strain DH10BZIP was chosen as the host cell for the screening and pigment 
production, although we have also used TOP10F* and XLl-Blue for this purpose. DH10B 
cells were transformed with plasmid pAC-BETA-04 and were plated on LB agar plates 
containing chloramphenicol at 50 ng/ml (from United States Biochemical Corporation). The 
phagemid Arahidopsis cDNA library was then introduced into DH10B cells already 
containing pAC-BETA-04. Transformed cells containing both pAC-BETA-04 and 
Arahidopsis cDNA library phagemids were selected on chloramphenicol plus ampicillin (150 
Hg/ml) agar plates. Maximum color development occurred after 3 to 7 days incubation at 
room temperature, and the rare bright yellow colonies were selected from a background of 
many thousands of orange colonies on each agar plate. Selected colonies were inoculated 
into 3 ml liquid LB medium containing ampicillin and chloramphenicol, and cultures were 
incubated at room temperature for 1-2 days, with shaking. Cells were then harvested by 
centrifugation and extracted with acetone in microfuge tubes. After centrifiigation, the 
pigmented extract was spotted onto silica gel thin-layer chromatography (TLC) plates, and 
developed with a hexanerether (1 :1, by volume) mobile phases. B-carotene hydroxylase- 
encoding cDNAs were identified based on the appearance of a yellow pigment that co- 
migrated with zeaxanthin on the TLC plates. 
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Subcloning and Sequencing 

The plasmid containing the fi-carotene hydroxylase cDNA was recovered and 
analyzed by standard procedures (Sambrook et al., 1989). The Arabidopsis B-carotene 
hydroxylase was sequenced completely on both strands on an automatic sequencer (Applied 
5 Biosystems, Model 373 A, Version 2.0. 1 S). The cDNA insert of 0.95kb also was excised 
and ligated into the a pTrcHis vector. A BglH restriction site within the cDNA was used to 
remove that portion of the cDNA that encodes the predicted polypeptide N terminal sequence 
region that is not also found in bacterial [5-carotene hydroxylases (Figure 6). A Bglll-Xhol 
fragment was directionally cloned in BamHI-XhoI digested TrcHis vectors. 

1 0 Pigment An alysis 

A single colony was used to inoculate 50 ml of LB containing ampicillin and 
chloramphenicol in a 250-ml flask. Cultures were incubated at 28°C for 36 hours with gentle 
shaking, and then harvested at 5000 rpm in an SS-34 rotor. The cells were washed once with 
distilled H 2 0 and resuspended with 0.5 ml of water. The extraction procedures and HPLC 
15 were essentially as described previously (Cunningham et al, 1994). 

II. Isolation and biochemical analysis of an Arabidopsis lycopene e-cyclase 
Plasmid Construction 

Construction of plasmids pAC-LYC, pAC-NEUR, and pAC-ZETA is described in 
Cunningham et al., (1994). In brief, the appropriate carotenoid biosynthetic genes from 

20 Erwinia herbicola, Rhodobacter capsulatus, and Synechococcus sp. strain PCC7942 were 

cloned in the plasmid vector pACYCl 84 (New England BioLabs, Beverly, MA). Cultures of 
E. coli containing the plasmids pAC-ZETA, pAC-NEUR, and pAC-LYC, accumulate £- 
carotene, neurosporene, and lycopene, respectively. The plasmid pAC-ZETA was 
constructed as follows: an 8.6-kb Bglll fragment containing the carotenoid biosynthetic genes 

25 of E. herbicola (GenBank M87280; Hundle et al., 1991) was obtained after partial digestion 
' y of plasmid pPL376 (Perry et al., 1986; Tuveson et al., 1986) and cloned in the BamHI site of 
pACYCl 84 to give the plasmid pAC-EHER. Deletion of adjacent 0.8- and 1 .1-kb BamHI- 
BamHI fragments (deletion Z in Cunningham et al., 1994), and of a 1.1 kB Sall-Sall fragment 
(deletion X) served to remove most of the coding regions for the E. herbicola p -carotene 

30 hydroxylase (crtZ gene) and zeaxanthin glucosyltransferase (crtX gene), respectively. The 
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resulting plasmid, pAC-BETA, retains functional genes for geranylgeranyl pyrophosphate 
synthase (crtE), phytoene synthase (crtB), phytoene desaturase (crti), and lycopene cyclase 
(crtY). Cells of E. coli containing this plasmid form yellow colonies and accumulate 0- 
carotene. A plasmid containing both the lycopene e- and P -cyclase cDNAs of A. thaliana 
5 was constructed by excising the e-cyclase in clone y2 as a PvuI-PvuII fragment and ligating 
this piece in the SnaBI site of a plasmid (pSPORT 1 from GIBCO-BRL) that already 
contained the p -cyclase (Cunningham et al., 1996). 

Organisms and Growth Conditions 

E. coli strains TOP 10 and TOP 10 F' (obtained from Invitrogen Corporation, San 
10 Diego, CA) and XL 1 -Blue (Stratagene) were grown in Luria-Bertani (LB) medium 

(Sambrook et al., 1989) at 37°C in darkness on a platform shaker at 225 cycles per min. 

Media components were from Difco (yeast extract and tryptone) or Sigma (NaCl). 

Ampicillin at 150 |ig/mL and/or chloramphenicol at 50 [ig/mL (both from United States 

Biochemical Corporation) were used, as appropriate, for selection and maintenance of 
15 plasmids. 

Mass Exc ision and Color Complementation Screening of an A. thaliana cDNA TJhrary 
A size-fractionated 1-2 kB cDNA library of A. thaliana in lambda ZAPII (Kieber et 
al., 1993) was obtained from the Arabidopsis Biological Resource Center at The Ohio State 
University (stock number CD4-14). Other size fractionated libraries were also obtained 

20 (stock numbers CD4-13, CD4-15, and CD4-16). An aliquot of each library was treated to 

cause a mass excision of the cDNAs and thereby produce a phagemid library according to the 
instructions provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1- 
Blue and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of E. coli 

25 TOP 10 F' (this strain contained the plasmid pAC-LYC) by incubation of the phagemid with 
the E. coli cells for 15 min at 37°C. Cells had been grown overnight at 30°C in LB medium 
supplemented with 2% (w/v) maltose and 10 mM MgS0 4 (final concentration), and harvested 
in 1.5 ml microfuge tubes at a setting of 3 on an Eppendorf microfuge (541 5C) for 10 min. 
The pellets were resuspended in 10 mM MgS0 4 to a volume equal to one-half that of the 
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initial culture volume. Transformants were spread on large (150 mm diameter) LB agar petri 
plates containing antibiotics to provide for selection of cDNA clones (ampicillin) and 
maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming units 
were spread on each plate. Petri plates were incubated at 37-C for 16 hr and then at room 
5 temperature for 2 to 7 days to allow maximum color development. Plates were screened 
visually with the aid of an illuminated 3x magnifier and a low power stage-dissecting 
microscope for the rare, pale pinkish-yellow to deep-yellow colonies that could be observed 
in the background of pink colonies. A colony color of yellow or pinkish-yellow was taken as 
presumptive evidence of a cyclization activity. These yellow colonies were collected with 

10 sterile toothpicks and used to inoculate 3ml of LB medium in culture tubes with overnight 
growth at 37°C and shaking at 225 cycles/min. Cultures were split into two aliquots in 
microfuge tubes and harvested by centrifugation at a setting of 5 in an Eppendorf 541 5C 
microfuge. After discarding the liquid, one pellet was frozen for later purification of plasmid 
DNA. To the second pellet was added 1.5 ml EtOH, and the pellet was resuspended by 

15 vortex mixing, and extraction was allowed to proceed in the dark for 1 5-30 min with ?. 

occasional remixing. Insoluble materials were pelleted by centrifugation at maximum speed 
for 10 min in a microfuge. Absorption spectra of the supernatant fluids were recorded from 
350-550 nm with a Perkin Elmer lambda six spectrophotometer. 



Analysis of isolated clones 

20 Eight of the yellow colonies contained p -carotene indicating that a single gene 

product catalyzes both cyclizations required to form the two P endgroups of the symmetrical 
p -carotene from the symmetrical precursor lycopene. One of the yellow colonies contained a 
pigment with the spectrum characteristic of 6 -carotene, a monocyclic carotenoid with a single 
e endgroup. Unlike the p cyclase, this e-cyclase appears unable to carry out a second 

25 cyclization at the other end of the molecule. 

The observation that e-cyclase is unable to form two cyclic e-endgroups (e.g. the 
bicyclic e-carotene) illuminates the mechanism by which plants can coordinate and control 
the flow of substrate into carotenoids derived from P -carotene versus those derived from a- 
carotene and also can prevent the formation of carotenoids with two e endgroups. 

30 The availability of the A. thaliana gene encoding the e-cyclase enables the directed 

manipulation of plant and algal species for modification of carotenoid content and 
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composition. Through inactivation of the e-cyclase, whether at the gene level by deletion of 
the gene or by insertional inactivation or by reduction of the amount of enzyme formed (by 
such as antisense technology), one may increase the formation of p -carotene and other 
pigments derived from it. Since vitamin A is derived only from carotenoids with p 
5 endgroups, an enhancement of the production of p -carotene versus a -carotene may enhance 
nutritional value of crop plants. Reduction of carotenoids with e-endgroups may also be of 
value in modifying the color properties of crop plants and specific tissues of these plants. 
Alternatively, where production of a -carotene, or pigments such as lutein that are derived 
from a -carotene, is desirable, whether for the color properties, nutritional value or other 

10 reason, one may overexpress the e-cyclase or express it in specific tissues. Wherever 

agronomic value of a crop is related to pigmentation provided by carotenoid pigments the 
directed manipulation of expression of the €-cyclase gene and/or production of the enzyme 
may be of commercial value. 

The predicted amino acid sequence of the A. thaliana e-cyclase enzyme was 

1 5 determined. A comparison of the amino acid sequences of the P- and e-cyclase enzymes of 
Arabidopsis thaliana (Fig. 13) as predicted by the DNA sequence of the respective cDNAs 
(Fig. 4 for the e-cyclase cDNA sequence), indicates that these two enzymes have many 
regions of sequence similarity, but they are only about 37% identical overall at the amino acid 
level. The degree of sequence identity at the DNA base level, only about 50%, is sufficiently 

20 low such that we and others have been unable to detect this gene by hybridization using the P 
cyclase as a probe in DNA gel blot experiments. 
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We claim: 

1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 
lycopene €-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23 or 25-27. 

5 2. The nucleic acid sequence of claim 1, wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23 or 25-27. 

3. A vector comprising the nucleic acid sequence of claim 1, wherein the nucleic acid 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

10 5. The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene e-cyclase enzyme activity and 
having an amino acid sequence which is at least 85% identical to one of SEQ ID NOS: 23 or 

15 25-27. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS : 23 or 25-27. 
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AMENDED CLAIMS 
[received by the International Bureau on 15 November 1999 (15.1 1 .99); 
original claims 1,2,7 and 8 amended; remaining claims unchanged (1 page)] 
1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23, 25 or 26. 

5 2. The nucleic acid sequence of claim 1 , wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23, 25 or 26. 

3. A vector comprising the nucleic acid sequence of claim 1, wherein the nucleic acid 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

10 5 - The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene €-cyclase enzyme activity and 
having an amino acid sequence which is at least 85% identical to one of SEQ ID NOS: 23, 25 

15 or 26. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS: 23, 25 or 26. 



28 

AMENDED SHEET (ARTICLE 19) 



BNSDOCID: <WO 9963055A1_I_> 



WO. 99/63055 




PCT/US99/12121 



1 / 30 



MVA IPP DMAPP i-*. GPP 

ipp J 



CH 7 OPP ppp 



PSY 



PDS 



ZDS 



MTTA 



LCY 




p-carotene 
xanthophylls abscisic acid 



FIGURE 1 



BNSDOCID: <WO 9963055A1. L> 



WO 99/63055 



PCT/US99/12121 



2 / 30 



It 



a-zeacarotene 



^ cycteM 



neurosporene 



I < 



douturas* 



t 



<3esaturas« 



I 



5-caroten 



^ cydaaa 



lycopene 



S 

cydaw 



cydaa* 



(jC" r "" 

" % 3-zeacarotene 
, y-carotene 



^ * o-carotene 

^ e-carotene VK 

a-carotene >0>... r ..^Jj^J 

I *S^S* R-carntona 



p-carotene 

hydroxylase 



j^^-R-OQ 



ct-cryptoxanthin 



R=- 




_ [' " h » » w 



zemoxanthin 

I 
I 

t 

x^.-r.-OQ^ 

lutein 



OH 



no~*^ (J-cryptoxanthin 
' I 

hydroxylase 

zeaxanthin 



OH 



HO 



FIGURE 2 



BNSDOCID:<WO 99R30R*ai i > 



WO 99/63055 ^ ^ PCT/US99/12121 

4/30 



Arabidopsis thaliana epsilon cyclase: 
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1 caccaatgtc tgtttcttct ttatttaatc tcccattgat tcgcctcaga 

51 tctctcgctc tttcgtcttc tttttettct ttccGATTTG CCCATCGTCC 

101 TCTGTCATCG ATTTCACCGA GAAAGTTACC GAATTTTCGT GCTTTCTCTG 

151 GTACCGCTAT GACAGATACT AAAGATGCTG GTATGGATGC TGTTCAGAGA 

201 CGTCTCATGT TTGAGGATGA ATGCATTCTT GTTGATGAAA CTGATCGTGT 

251 TGTGGGGCAT GTCAGCAAGT ATAATTGTCA TCTGATGGAA AATATTGAAG 

301 CCAAGAATTT GCTGCACAGG GCTTTTAGTG TATTTTTATT CAACTCGAAG 

3 51 TATGAGTTGC TTCTCCAGCA AACGTCAAAC ACAAAGGTTA CGTTCCCTCT 

401 AGTGTGGACT AACACTTGTT GCAGCCATCC TCTTTACCGT GAATCAGAGC 

451 TTATCCAGGA CAATGCACTA GGTGTGAGGA ATGCTGCACA AAGAAAGCTT 

501 CTCGATGAGC TTGGTATTGT AGCTGAAGAT GTACCAGTCG ATGAGTTCAC 

551 TCCCTTGGGA CGTATGCTGT ACAAGGCTCC TTCTGATGGC AAATGGGGAG 

601 AGCATGAACT TGATTACTTG CTCTTCATCG TGCGAGACGT GAAGGTTCAA 

651 CCAAACCCAG ATGAAGTAGC TGAGATCAAG TATGTGAGCC GGGAAGAGCT 

701 GAAGGAGCTG GTGAAGAAAG CAGATGCAGG TGAGGAAGGT TTGAAACTGT 

751 CACCATGGTT CAGATTGGTG GTGGACAATT TCTTGATGAA GTGGTGGGAT 

801 CATGTTGAGA AAGGAACTTT GGTTGAAGCT ATAGACATGA AAACCATCCA 

851 CAAACTCTGA ACATCTTTTT TTAAAGTTTT TAAATCAATC AACTTTCTCT 

901 TCATCATTTT TATCTTTTCG ATGATAATAA TTTGGGATAT GTGAGACACT 

951 TACAAAACTT CCAAGCACCT CAGGCAATAA TAAAGTTTGC GGCCGC 
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1 CTCGGTAGCT GGCCACAATC GCTATTTGGA ACCTGGCCCG GCGGGAGTCC 
51 GATGCCGCGA TGCTTCGTTC GTTGCTCAGA GGCCTCACGC ATATCCCCCG 
101 CGTGAACTCC GCCCAGCAGC CCAGCTGTGC ACACGCGCGA CTCCAGTTTA 
151 AGCTCAGGAG CATGCAGATG ACGCTCATGC AGCCCAGCAT CTCAGCCAAT 
201 CTGTCGCGCG CCGAGGACCG CACAGACCAC ATGAGGGGTG CAAGCACCTG 
251 GGCAGGCGGG CAGTCGCAGG ATGAGCTGAT GCTGAAGGAC GAGTGCATCT 
301 TGGTGGATGT TGAGGACAAC ATCACAGGCC ATGCCAGCAA GCTGGAGTGT 
351 CACAAGTTCC TACCACATCA GCCTGCAGGC CTGCTGCACC GGGCCTTCTC 
401 TGTGTTCCTG TTTGACGATC AGGGGCGACT GCTGCTGCAA CAGCGTGCAC 
451 GCTCAAAAAT CACCTTCCCA AGTGTGTGGA CGAACACCTG CTGCAGCCAC 
501 CCTTTACATG GGCAGACCCC AGATGAGGTG GACCAACTAA GCCAGGTGGC 
551 CGACGGAACA GTACCTGGCG CAAAGGCTGC TGCCATCCGC AAGTTGGAGC 
601 ACGAGCTGGG GATACCAGCG CACCAGCTGC CGGCAAGCGC GTTTCG CTTC 
651 CTCACGCGTT TGCACTACTG TGCCGCGGAC GTGCAGCCAG CTGCGACACA 
701 ATCAGCGCTC TGGGGCGAGC ACGAAATGGA CTACATCTTG TTCATCCGGG 
7S1 CCAACGTCAC CTTGGCGCCC AACCCTGACG AGGTGGACGA AGTCAGGTAC 
801 GTGACGCAAG AGGAGCTGCG GCAGATGATG CAGCCGGACA ACGGGCTGCA 
851 ATGGTCGCCG TG GTTT CGCA TCATCGCCGC GCGCTTCCTT GAGCGTTGGt 
901 GGGCTGACCT GGACGCGGCC CTAAACACTG ACAAACACGA GGATTGGGGA 
951 ACGGTGCATCS ACATCAACGA AGCGTGAAAG CAGAAGCTGC AGGATGTGAA 
1001 GACACGTCAT GGGGTGGAAT TG CGTACTTG GCAGCTTCGT ATCTCCTTTT 
1051 TCTGAGACTG AACCTGCAGT CAGGTCCCAC AAGGTCAGGT AAAATGGCTC 
1101 GATAAAATGT ACCGTCACTT TTTGTCGCGT ATACTGAACT CCAAGAGGTC 
1151 AAAAAAAAAA AAAAA 
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1 CTCGGTAGCT GGCCACAATC G CTATTTGG A ACCTGGCCCG GCGGCAGTCC 
51 GATGCCGCGA TGCTTCGTTC GTTGCTCAGA GGCCTCACGC ATATCCCGCG 
101 CGTGAACTCC GCCCAGCAGC CCAGCTGTGC ACACGCGCGA CTCCAGTTTA 
151 AGCTCAGGAG CATGCAGCTG CTTTCCGAGG ACCGCACAGA CCACATGAGG 
201 GGTGCAAGCA CCTGGGCAGG CGGGCAGTCG CAGGATGAGC TGATGCTGAA 
251 GGACGAGTGC ATCTTGGTAG ATGTTGAGGA CAACATCACA GGCCATGCCA 

3 01 G CAAGCTGG A GTGTCACAAG TTCCTACCAC ATCAGCCTGC AGGCCTGCTG 
351 CACCGGGCCT TCTCTGTGTT CCTGTTTGAC GATCAGGGGC GACTGCTGCT 

4 01 GCAACAGCGT GCACGCTCAA AAATCACCTT CCCAAGTGTG TGGACGAACA 
451 CCTGCTGCAG CCACCCTTTA CATGGGCAGA CCCCAGATGA GGTGGACCAA 
501 CTAAGCCAGG TGGCCGACGG AACAGTACCT GGCGCAAAGG CTGCTGCCAT 
551 CCGCAAGTTG GAGCACGAGC TGGGGATACC AGCGCACCAG CTGCCGGCAA 
601 GCGCGTTTCG CTTCCTCACG CGTTTGCACT ACTGTGCCGC GGACGTGCAG 
651 CCAGCTGCGA CACAATCAGC GCTCTGGGGC GAGCACGAAA TGGACTACAT 
701 CTTGTTCATC CGGGCCAACG TCACCTTGGC GCCCAACCCT GACGAGGTGG 
751 ACGAAGTCAG GTACGTGACG CAAGAGGAGC TGCGGCAGAT GATGCAGCCG 
801 GACAACGGGC TTCAATGGTC GCCGTGGTTT CGCATCATCG CCGCGCGCTT 
851 CCTTGAGCGT TGGTGGGCTG ACCTGGACGC GGCCCTAAAC ACTGACAAAC 
901 ACG AGGATTG GGGAACGGTG CATCACATCA ACGAAGCGTG AAGGCAGAAG 
951 CTGCAGGATG TGAAGACACG TCATGGGGTG GAATTGCGTA CTTGGCAGCT 

1001 TCGTATCTCC T T T TT C TGAC ACTGAACCTG CAGAGCTAGA GTCAATGGTG 
1051 CATCATATTC ATCGTCTCTC TTTTGTTTTA GACTAATCTG TAGCTAGAGT 
1101 CACTGATGAA TCCTTTACAA CTTTCAAAAA AAAAA 
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1 

HPo^ MLRSLLRGLT HIPRVNSAQQ PSCAHARLQF KLRSMQMTLM QPSISANtJS 

HP09 MLRSLLRGLT HIPRVNSAQQ PSCAHARLQF KLRSMQLL QPSISAN ^SR 

ATDPT- MSVSSLFNLP . LIRLRSLA. LSSSFSSFRF AHRPLSSIs! PRKLPNFRAP 

C fc>r^- MS - SSMLNFT .ASRIVSLPL LSSPPSRVHL PLCFFSPISL TQRFSAKLTF 

Prro**3 TGPPPRFFP IRSPVPRTQL FVRAFSAV 

cc^J- .-MTADNNSM PHGAVSSYAK LVQNQTPEDI LEEFPEIIPL QQRPN*. ! !lR 

51 

AEDRTDHMRG ASTWAGGQSQ DELMLKDECI LVDVEDMITG HASKLECHKF 
SEDRTDHMRG ASTWAGGQSQ DELMLKDECI LVDVEDNITG HASKLECHKF 
S..GTA.MTD TKDAGMDAVQ RRLMFEDECI LVD ETDRWG HVSKYNCHLM 
SSQATT . MGE WDAGMDAVQ RRLMFEDECI LVD ENDKWG HESKYNCHLM 

T.MTD SNDAGMDAVQ RRLMFEDECI LVDENNRWG HDTKYNCHLM 

SSETSNDESG ETCFSGHDEE QIKLMNENCI VLDWDDNAIG AGTKKVCHLM 

101 150 
LPHQPAGLLH RAFSVFLFDD QGRLLLQQRA RSKITFPSVW TNTCCSHPLH 
LPHQPAGLLH RAFSVFLFDD QGRLLLQQRA RSKITFPSVW TNTCCSHPLH 
ENIEAKNLLH RAFSVFLFNS KYELLLQQRS NTKVTFPLVW TNTCCSHPLY 
EKIESENLLH RAFSVFLFNS KYELLLQQRS ATKVTFPLVW TNTCCSHPLY 
EKIEAENLLH RAFSVFLFNS KYELLLQQRS KTKVTFPLVW TNTCCSHPLY 
ENIE. KGLLH RAFSVFIFNE QGELLLQQRA TEKITFP DLW TNTCCSHPLC 

151 200 

GQTPDEVDQL SQVADGTVPG AKAAAIRKLE HELGIPAHQL PA. SAFRFLT 

GQTPDEVDQL SQVADGTVPG AKAAAIRKLE HELGIPAHQL PA. SAFRFLT 

SELIQDNALG VRNAAQRKLL DELGIVAEDV PV.DEFTPLG 

R E SELIDENCLG VRNAAQRKLL DELGIPAEDL PV.DQFIPLS 

RE S ELI EENVLG VRNAAQRKLF DELGIVAEDV PV.DEFTPLG 

ID . . . DELGL KGKLDDKIKG AITAAVRKLD HELGIPEDET KTRGKFHFLN 

2 °1 250 
RLHYCAADVQ PAATQSALWG EHEMDYILFI .... RANVTL APNPDEVDEV 

RLHYCAADVQ PAATQSALWG EHEMDYILFI RANVTL APNPDEVDEV 

BMLY KAPSDGKWG EHELDYLLFI VRDVKV QPNPDEVAEI 

KAPSDGKWG EHELDYLLFI IRDVNL DPNPDEVAEV 

RMLY KAPSDGKWG EHEVDYLLFI VRDVKL QPNPDEVAEI 

RIHY MAPSNEPWG EHEIDYILFY KINAKENLTV NPNVNEVRDF 

251 300 

RYVTQEELRQ MMQ PDN GLQWSPWFRI IAARFLERWW ADLDAALNTD 

RYVTQEELRQ MMQ PDN GLQWSPWFRI IAARFLERWW ADLDAALNTD 

KYVSREELKE LVKKADAGEE GLKLSPWFRL WDNFLMKWW DHVEKGTLVE 
KYMNRDDLKE LLRKADAEEE GVKLSPWFRL WDNFLFKWW DHVEKGS LKD 
KYVSREELKE LVKKAD AG D E AVKLSPWFRL WDNFLMKWW DHVEKGTITE 
KWVSPNDLKT MF ADP SYKFTPWFKI I CENYLFNWW EQLDDLSEVE 

301 

KHEDWGTVHH INEA* 

KHEDWGTVHH INEA* 

A. IDMKTIHK L* 

A . ADMKTIHK L* 

A. ADMKTIHK L* 

NDRQ. . . IHR ML* 
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1 ==.» aacaa ctc aaat « c «= c9t = gct c« actccgc catgggtg , c 
« 9 actc cggca tggatgctgt teagcgac5t rtca icgatgMtg 
=a« ttggtg gatgagtgtg 9ggaMt ^ 

m .« gt c ictt gatggagaag attgaaa=ag gtaaaatgct g 
ox « cagcg « t ttctatt=aa ttcaaaata= 9agttactt= ^ 

^.cc aaggtgacat tt=ctttagt atggaccMc ^ 
9 c= atccact ctacagagaa tc=gagettg tte=cgaMc 

=1 «gc tgca = a gaggaxscxJat xxxxxxxxxx xxxxxxxxxx xxxxxxxxxx 

xxxxxxxxxx xxxxxxxxxx xioqooo _ 
*~***~ XX ™ xxxx xxx=ooccoq< 
sox ™* xxxxxxx xxxxxxxxxx xxxxxxxxxx xxmacx xxxxxxxxxx 
— ™ *~~™ x* xxxxxxxx xxxxxxxxxx 

"X X X X XXXXXXX ™, XJQQ( ^^^^ 

^ — — ~™~x tc atgtg c aa aagggta = ac 
7 ox e cct gaatg = aat « gata tgaaaaceat acacaagctg atatagaaac 

«x ccgaaaagea agcctaataa ttcgggttgg 

aox == at = aattg ««« tctt ttaacaactt ttaatcteta tttgagcatg 

85X tt 9att =« g fttttrtt »U»tttl 9 ,g t «= g « t= agttgtaa 
•01 t aatgaacca « gatggt « gcaattt=aa gttcctat=g a=atgtagtg 
951 atctaaaaaa 
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Plane beea 



Plane beta 
Consensus 



Plane beta 
A.c. epMilon 
Consensu* 



Plane beca 
A.e. cpmiiaa 
Consensus 



Plane bees 
A.e. opsii on 

Consensus 



Plane beea 
A.e. epsii on 

Consensus 



Plane beea 
A.C. opmilan 
Consensus 



Plane beea 
A.c. cpsiiaa 

Consensus 



*~ ™» S H~! ™? SSSS 

71 Cyancbacterial enzyme begins 1 

Vk.-SsAIXe LVP gnoaw . OFlXPrnYDp. « r« L/ 
VXAGGSB2X. PVCHQOKKBM DKQSIxOSk£ PpifiS^Z 
VX---S--L- -V - o O s 



141 



Possible subunit interaction domain 



DLAWGGGPA 
DKWXCCCPA 
D--V-G-GPA 



CIAVXOQVS1 
CLALAAksxx 
GLA-A 



140 

AGX^VcSXDP 
LCUCVCtlGP 



DLP 
--P 



PXLIWPHM YGVWVDBFHA MDLLOCLOaT WSCa-VYiBH 
»P . . . FTNM YCVWKDBmt) LCX^XCIKHV WRZTTVYLM 
P W YGVW-DBF-- --L-C W VY-00 



4 Omucleotide-binding signature !f 



Conserved regional 



-e-KDt-apr 

D JCP ITICRAY 
R-Y 



GRVHRXQUCS 
GRVS KJU r .Tff 
GRV-a-.L-- 



210 

ntBQKCX-HG 
ST.f .HRCVBSG 
— — -C-— G 



211 
VXFHqaXVlk 
VSYLSSJCVT5S 
V XV-- 



VlHS.B-kSa 
ITSASOGUU. 



llcaDG-ero 

VACDDHHVTP 



AeWLDATCF 
CRLATVASGA 
A-G- 



281 

PFD--KMVfM DWRDaHL-nn 

PYDPDCMVTM DYRDY..TNB 

P-D MVTM D-U) H- 

^ Conserved region #2 

3S1 

-HLCIXVKsI BKDBhCVXPH 

DTLCIRHJCT VMWSTIPV 

--lgi-.--- -a-g — ip- 



421 



SR-.LVOYDK PYnPGY.OVA YGIlABVeeK 
ASGXXMYIV GGPRVCvSlA y££bv££ 
L-QY O-A YG---BV--- 



* *LXX*X* - 1 PTFLYAKPFS SRrlTtSlTS 
.XVRSUUUY PTFLYAMPW KSr£fp££? 
---------- PT7XYAHP-- --R-F-8ST- 



4 Conserved region #3 



££££535 ^ ACC ™» VKPSTGYMVA 

GGSUPHTBQ1C NLAFGAAASM VBPATCYSW 

GG-LP— O- Q--A-H VKF-TOY-V- 

4- 



350 

LVXBJKTLxTad OXQBRMvARI. 

LASXDVKPFD UJCTx£klAI. 

L o ju, 

4- 

„ _ 420 

KTIAAAPWA HAIi-YLgSe 

RSL3BAPKYA SVIABIIJU* 

R-L--AP--A --I--L--B 



Conserved region #4 Predicted TM hetix 



WiiM SSSSSS 5S2SB SSSSSR 

S— -ER-RQR-FF «FG D R-PF- 



3SSS S3S SSSK; SSSJ SUSSSS 

-GFIi*S-I»~» - - L- - F- L- - P----M--B- 



^ Conserved region #5 



410 

AFFOLePrYW 
TFTRXPKWMW 
-FF-L If 



533 
O-e 

KV. 



Predicted TM helix 
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Adonis palaescina e-cyclase cONA »5 Length: 



1898 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 . 
1751 
1801 
1851 



aaaggagtgt 
caaactccat 
gagtatctgg 
ccctgtgtgg 
ataacataca 
gctgatggtg 
tgtggatgaa 
tccaaatgca 
aagttgccac 
a< ?gttgtgga 
ggttgaaagt 
Sgtgtgtggg 
gcatgcttgg 
ttattggtcg 
ttgctgaaaa 
ggaaaggatc 
atgagacctt 
tcagggaaac 
aaccgcttac 
actcaatggt 
tgctcggaag 
aacaagactt 
cattcgatct 
atccaagtta 
tggttcttta 
caagcatggt 
gaagctccaa 
ctctgcgtat 
aagcatggag 
tttcttttTg 
aacattcttt 
tccttgggtc 
atgtttgctt 
gctttcagat 
agtctcatct 
ttcgaatgtg 
taaagtaaat 
aaagtaaatt 



tctattaatg 
tttctcctct 
ctacggaact 
acttttggaa 
tcgatatggt 
gaagcgggag 
gaggatttta 
gcaaacaaag 
caataccttt 
cctgctggtc 
tggc'cttatt 
aagacgagtt 
aaggacacca 
tgcatatgga 
g<?tgtgtgga 
actgaagctg 
tatcccttgc 
ttttggagta 
ggggrggagg 
actcatggac 
aagaatatcc 
ttttttgagg 
actgaagaga 
caaaagttta 
ccaaacacag 
gcatccagca 
aatatgcttc 
gtggtttctg 
cagtctttgg 
gattagagct 
agaaccttct 
ttcactatca 
tggcgccaaa 
ccttctggtg 
attattaaac 
tatatgatca 
gccgggtttg 
tattgataca 



tcactgtcgc 
tctcttcaaa 
acttggtgtt 
caagaaacct 
tcttcttgta 
tagaagttct 
tcaaagctgg 
tctatggaga 
tggagaatcc 
tttcactggc 
ggtcctgatc 
caaagatctt 
tcgtatatct 
cgagttagtc 
gtcaggtgta 
gtgatggcca 
aggcttgcca 
tgaagtaggt 
ttgaggcgga 
tacagagact 
aacatttctC 
aaacctgtct: 
aaactgatgt 
tgaagaggaa 
agcaaaagaa 
acaggctatt 
tgtaattgca 
gacaaagtag 
ccaaaggagc 
tattgtgcag 
tccgcttgcc 
tctttcgatc 
cagcatgagg 
cagttatggt 
tctagtgctt 
tctctatgta 
atattgttgt 
aaagtagttc 



attcttgcaa 
acaacaaacc 
cgcaacctca 
tagcagttca 
gagtagattt 
gttgcttata 
tggttctgag 
aacaggccaa 
gtgatggact 
tgcagaagct 
ttccttttac 
ggacttgaac 
tgataatgat 
gacatttgct 
tcatatctgg 
tagccttgta 
ctgttgcatc 
ggccctcgtg 
gaacaatcca 
atatgcaaca 
tatgtcatgc 
ggcctcaaaa 
cacgattgaa 
tggtcatata 
cctagcattt 
cggttgtacg 
aagattttga 
tgcagtaaac 
gaaaacgtca 
ctagatattg 
aacttggatg 
tcgtcttgtt 
atgtcacttg 
aagagcttac 
caccaaataa 
tatcctgtac 
gtcaaaccgg 
ttttccttaa 



cacttatatt 
aatgtgagca 
tctcttcttg 
aaactagctt 
tcaagtgaga 
aagagggttt 
cttttgtttg 
gctcgccgat 
tggtcgtaat 
gctaagctag 
aaataattat 
gttgtatcga 
gctcctgtcc 
acatgaggag 
attctaaagt 
gtttgtgaaa 
tggagcagct 
tttgtgtcca 
tacgatccca 
gaaattacag 
ccatgtcgcc 
gatgccatgc 
gactctgggt 
ttcctgttgg 
ggtgctgcag 
gtcactgtca 
agcaagataa 
atttcaatgc 
aagagcatTc 
aagcaaccag 
tggtggggtt 
ttccatgtac 
tgagacattt 
ctcgaaaggt 
atgaggatcc 
tctaatctca 
ccaatgatat 
aaaaaaaa 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 



MELLGVRNLI 
SGSRSSVAYK 
IPFGESVMDL 
DEFKDLGLER 
CVESGVSYLD 
LEYEVGGPRV 
EYPTFLYVMP 
KVYEEEWSYI 
YASVIAKILK 

LELIVQLDIE 
APNSMRMSLV 



SSCPVWTFGT 
EGFVDEEDFI 
WIGCGPAGL 
CIEHAWKDTI 
SKVERXTEAG 
CVQTAYGVEV 
MSPTRLFFEE 
PVGGSLPNTE 
QDNSAYWSG 

ATRTFFRTFF 
RHLLSDPSGA 



RNLSSSKLAY 
KAGGSELLFV 
SLAAEAAKLG 
VYLDNDAPVL 
DGHSLWCEN 
EVENNPYDPN 
TCLASKDAMP 
QKNLAFGAAA 
QSSAVNISMQ 

RLPTWMWWGF 
VMVRAYLER* 



NIHRYGSSCR 
CMQQTKSMEK 
LKVGLIGPOL 
IGRAYGRVSR 
EIFIPCRLAT 
LMVFMDYRDY 
FDLLKRKLMS 
SMVHPATGYS 
AWSSLWPKER 

LGSSLSS FDL 



VDFQVRADGG 
QAKLADfCLPP 
PFTNNYGVWE 
HIXHEELLKR 
VASGAASGKL 
MQQKLQCSEE 
RLKTLGIQVT 
WRSLSEAPK 
KRQRAFFLFG 

VLFSMYMFVL 
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DNA sequence of poi 



o cDNA (GenBank R27545) obtained from Nicholas J. Provart 



potato. aeq 
1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 



Length: 1 
tagcggnnnn 
aacatgtttg 
cttattggcc 
gttactcaaa 
tggataggat 
ggtgatgttg 
ctcggggaaa 
aaacagctta 
agcctgatgg 
atctttagaa 
caacacgagt 
ccattcgatc 
cgtaagaatc 
gaggatcttt 
gctagcacgg 
tgaagctcca 
atagcaagaa 
gcttggaaca 
cctatttgga 
catttttccg 
cttggttcaa 
gtttattatt 
tatctgatcc 
agtaaattcc 
actcataaca 
ctaggttaat 
cttctttttt 
gtatatataa 



378 August 
naggatgagt 
gcgggatacc 
gtgcctatgg 
aggtgtgtgg 
tgttgaggcc 
tgattccctg 
ttcttgcagt 
tggagtggaa 
ttttcatgga 
gctaaatatc 
ctttttcgag 
tgttaaagaa 
aaagaaattt 
gccaaataca 
ttcatccagc 
aaatgcgcct 
target, tact 
ctctttggcc 
ctggctctga 
cgcgttcttc 
gtctttcttn 
gcaccaaatg 
tactggtgca 
tcctacaata 
gaaatcgegg 
attgettgaa 
gtccaagatc 
attttataaa 



2, 1996 13:06 Type: 
tcaaagatct tggtcttcaa 
attgtatatc ttgatgatga 
aagagttagt cgccatttac 
aggcaggtgt tttgtatcta 
acaaatggee acagtcttgt 
caggtttgtg actgttgcat 
atgagttggg aggtcctaga 
gttgaggtcg ataacaatcc 
ttatagagac tatgtcagac 
caacatttct etatgecatg 
gaaacttgtt tggcttcaaa 
aaaattgatg ttacgattga 
atgaggagga atggtct-tac 
gaacaaaaaa cacttgeatt 
cacaggttat teagtegtea 
tcgtgcttgc aaatatatta 
agttcaagta ccccgagtat 
acaagaacga aaacgacaaa 
tattgeaget ggatattgag 
cgtgtgccaa aatggatgtg 
agcagacctc atgttatttg 
acatgagaag aggcttaatc 
acattgataa gaacttatct 
gttgttgaan nagaggcetc 
tctctcgagg ccttgtatat 
taagttgcac agtttcagtt 
atgtattgan ccaatttata 
aaaaaaaa 



N Check: 605 
gectgeattg 
tgatcctatt 
tgcacgagga 
aactcgaaag 
agagtgegag 
egggagcage 
gtttctgttc 
atttgacccg 
acgacgctca 
cccatgtctc 
agatgeaatg 
acaccctcgg 
ataccagttg 
tggtgctgct 
gatcactgtc 
cgacaaaatc 
ttcaactcaa 
gatcgttttt 
gggataaggt 
gcagggattt 
ccttctacat 
agacatcttt 
tacattttag 
attacttcag 
aacattttca 
tttgtatctg 
tacattgeca 



poteps.pep Length: 378 TRANSLATE from: 14 to: 1147 

1 DEFKDLGLQA CIEHVWRDTI VYLDDDDPIL IGRAYGRVSR 

51 CVEAGVLYLN SKVDRIVEAT NGHSLVECEG DWIPCRFVT 

101 LQYELGGPRV SVQTAYGVEV EVDNNPFDPS LMVFMDYRDY 

151 KYPTFLYAMP MS PTRVFFEE TCLASKDAMP FDLLKXKLML 

201 EIYEEEWSYI PVGGSLPOTE QKTLAFGAAA SMVHPATGYS 

251 GAFVLANILR QNHSKNMLTS SSTPSISTQA WNTLWPQERK 

301 ALILQLDIEG IRSFFRAFFR VPKWMWQGFL GSSLSXADLM 

351 PNOMRRGLIR HLLSOPTGAT LIRTYLTF* 



HLLHEELLKR 
VASGAASGKF 
VRHDAQSLEA 
RLNTLGVRIK 
WRSLSEAPK 
RQRSFFLFGL 
LFAFYMFXIA 



FIGURE 15A 



Chimeric lettuce/potato lycopene e-cyclase: converts lycopcne to 5-carotene, the lettuce cona 

converts lycopene to e-carotene and the potato cONA does not produce an active enzyme 

(amino acids in lower case are from lettuce and those in uppercase are from the potato cDNA; an Avail site in common to me 
two cDNAs was used to construct the chimera) 



1 meefgarnmt 

51 kcaaksdrcv 

101 eklaqipign 

151 ygvwqdefig 

201 ellrrcvesg 

251 aagkfleyel 

301 QSLEAKYPTF 

351 GVRIKEIYEE 

401 SEAPKCAFVL 

4 51 FLFGLALILQ 

501 MFIIAPN0MR 



atmavftcpr 

vdkqgiavad 

cildlwigc 

lglegciehs 

vsylsskver 

gGPRVSVQTA 

LYAMPMSPTR 

EWSYIPVGGS 

ANILRQNHSK 

LDIEGIRSFF 

RGLIRHLLSD 



ftdcnirhkf 

eedyvkagga 

gpaglalaae 

wkdtlvyldd 

iteapngysl 

YGVEVEVDNN 

VFFEETCLAS 

LPNTEQKTLA 

NMLTSSSTPS 

RAFFRVPKWM 

PTGATLIRTY 



allkqrrftn 
elffvqmqrt 
saklglnvgl 
adpirigray 
iecegnltip 
PFDPSLMVFM 
KDAMPFOLLK 
FGAAASMVHP 
ISTQAWNTLW 
WQGFLGSSLS 
LTF* 



lsassalrqi 

ksmesqskls 

igpdlpftnn 

grvhrdllhe 

crlatvasga 

OYRDYVRHDA 

KKLMLRLNTL 

ATGYSWRSL 

PQERKRQRSF 

XAOLMLFAFY 



FIGURE 15B 
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GAP compact oe Asepsis e-cyclaae * pocaco £ -c y cl ase (pa.cial, 
blosum62.cmp Gap Weiaht- R 

L e ngCh „ eig S t : / rn; er a g e^ smatc Sr" 9 ^ aCCh: 2912 

Ratio: 3 tJg ^ 52* 

Percent Similarity- 79 aoi D « «. ~ P s : 1 

zoo n^vrLccp^vor^^^^^iiii^^i n 14 , 

.50 ^i^i^P^i^^i^^£" " 5 



35 1 LKTYEEEWSYIPVGGSLP1OTEQKNIAFGAAASMVHPATGYSVVRSLSEAP 

2oo raI ^i^i^--n,,,n,£f^ 



400 
249 



401 



KYASyiAKLREETTKQ! ..... KSNISRQAWDTLWPP^KRORAFFLFG 445 
KCArVLANILRQNHSKNMLTSSSTPS I STQAWTLWPQERKRQRS FFLFG 299 
446 ^JVQFDTEGIRSFFRT^ 

4 96 S PNNLRKGLINHLI S DPTGATMIKTYLKV 524 

".-I 111111:1:111 
350 APNDMRRGLIRHLLS D PTGATLIRTYLTF 378 



FIGURE 16 
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Adonis palaestina Ipil 

attcatcttc agcagcgctg tcgtactctt 
taacagtagt cgccgacggt tgaatcggct 
tgggtgaagt cactgatgct ggaatggatg 
ttcgacgacg aatgtatttt ggtggatgag 
tgattccaaa tacaactgtc atttgatgga 
tgcttcacag agccttcagt gttttcttgt 
cttcttcagc aacgatccgc cacaaaggta 
aaacacatgt tgcagtcatc ctctctttcg 
aaaattatct cggtgtacga aacgctgcac 
ctaggcattc cagctgaaga tgtcccagtt 
tcgcattctt tacaaagctc catctgacgg, 
tggactatct cctatttatt gtccgagatg 
gatgaagttg ctgatgctaa gtatgttaat 
actgagaaaa gctgatgctg gtgaagaggg 
ttagattggt tgttgataac tttttgttca 
cagggtacga ttaaggaagt tgctgacatg 
ttaagaggac ttctctcctc tgttctacta 
gtgggtggtg ataagcagtt tttctgtttt 
atttgcctcg atgttgaact tgtaacatat 
gtaagttgaa tttgaggctg aatttatatt 
aa 
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tctatatctt 
attcgcctca 
ctgttcagaa 
aatgacaagg 
aaagatagag 
tcaactcaaa 
acattcccgc 
tgattccgag 
aaagaaagct 
gatgaattta 
caaatgggga 
tgaaatacga 
cgcgaggagt 
actcaagttg 
agtggtggga 
aaaactatcc 
tttgtttttt 
ctttaattta 
ttagacaaat 
tttgggaaca 



cttccatcac 

acgtcaacta 

gcggctcatg 

tcgtcgggca 

gcagaaaatt 

atatgaattg 

tcgtatggac 

ctcatagaag 

tttagacgag 

ctcctcttgg 

gagcacgaat 

tccaaaccca " 

tgagagagat 

tctccttggt 

tcatgtagag 

acaagttgac 

gctacaataa 

tggcttttga 

atgagacctt 

taataatgtt 



FIGURE 17A 



Adonis 

5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
90 
95 
100 
105 
110 
115 
120 
125 



palaestina Ipi2 

ttttaaagct ctttcgctcc accaccatca 
tacaaaagtt aaaaacaccg ctttgggctt 
ccttgtttac gatacgcatc taaaccagta 
tcctaaatta ggcccctttc cggaatcccg 
gattaatcct ttatatagta tcttctccac 
cttcgtgttc ttctcccgct gttcatcttc 
tctatttctc cttccatcac taacagtcct 
gttcgcctca acgtcgacta tgggtgaagt 
ccgtccagaa gcggcttatg ttcgacgatg 
aatgacaagg tcgtcggaca tgattccaaa 
aaagatagag gcagaaaact tgcttcacag 
tcaactcaaa atacgagttg cttcttcagc 
acattcccgc tcgtatggac aaacacctgt 
tgattccgaa ctcatagaag aaaattttct 
aaaggaagct tttagacgag ctaggcattc 
gatgaattca ctcctcttgg tcgcattctt 
aaaatgggga gagcacgaac tggactatct 
tgaaatacga tccaaaccca gatgaagttg 
cgcgaggagt tgaaagagat actgagaaaa 
aataaagttg tctccttggt ttagattggt 
agtggtggga tcatgtagag gaggggaaga 
aaaactatcc acaagttgac ttaagagaaa 
tggtttttgc ttcaataagt ggatggtgat 
ttaattttgg cttttcaatt tgctttatgt 
agtcaaatat gagaccttgt gagttgaatt 
tgggaacata aaaaaaaaaa 



aagccagcca 
tggcccctcc 
attctcggtt 
agaattatgt 
caccactaaa 
agcagcgttg 
cgccgagggt 
cgctgatgct 
aatgtatttt 
tacaactgtc 
agccttcagt 
aacgatctgc 
tgcagccatc 
c 9gggtacga 
cagctgaaga 
tacaaagctc 
tctgtttatt 
ctgacgctaa 
gctgatgcag 
tgtggataac 
ttaaggacgt 
gtctcttaag 
gagcagtttt 
gttgaacttg 
tgaggttata 



aatttctctg 

atatcggaat 

ttaatttgtt 

cgtcgatcag 

acattatcag 

tcgtactctt 

tgaatcggct 

ggtatggatg 

ggtggatgag 

atttgatgga 

gttttcttat 

aacgaaggta 

ccctcttccg 

aacgctgcac 

cgtaccagtt 

catctgacgg 

gtccgagatg 

gtacgttaat 

gtgaagaggg 

tttttgttca 

cgccgacatg 

ttctactatt 

tatgcttcct 

taacatattt 

tttatagttt 



FIGURE 17B 
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WO 99/63055^ 

Haema tococcus pluvia 
ctcggtagct 
gatgccgcga 
cgtgaactcc 
agctcaggag 
ctgtcgcgcg 
ggcaggcggg 
tggtggatgt 
cacaagttcc 
tgtgttcctg 
gctcaaaaat 
cctttacatg 
cgacggaaca 
acgagctggg 
ctcacgcgtt 
atcagcgctc 
ccaacgtcac 
gtgacgcaag 
atggtcgccg 
gggctgacct 
acggtgcatc 
gacacgtcat 
tctgagactg 
gataaaatgt 
aaaaaaaaaa 
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lis Ipil 
ggccacaatc 
tgcttcgttc 
gcccagcagc 
catgcagatg 
ccgaggaccg 
cagtcgcagg 
tgaggacaac 
taccacatca 
tttgacgatc 
caccttccca 
ggcagacccc 
gtacctggcg 
gataccagcg 
tgcactactg 
tggggcgagc 
cttggcgccc 
aggagctgcg 
tggtttcgca 
ggacgcggcc 
acatcaacga 

ggggtggaat 

aacctgcagt 
accgtcactt 
aaaaa 



gccacttgga 

gttgctcaga 

ccagctgtgc 

acgctcatgc 

cacagaccac 

atgagctgat 

atcacaggcc 

gcctgcaggc 

aggggcgact 

agtgtgtgga 

agatgaggtg 

caaaggctgc 

caccagctgc 

tgccgcggac 

acgaaatgga 

aaccctgacg 

gcagatgatg 

tcatcgccgc 

ctaaacactg 

agcgtgaaag 

tgcgtacttg 

caggtcccac 

tttgtcgcgt 



acctggcccg 

ggcctcacgc 

acacgcgcga 

agcccagcat 

atgaggggtg 

gctgaaggac 

atgccagcaa 

ctgctgcacc 

gctgctgcaa 

cgaacacctg 

gaccaactaa 

tgccatccgc 

cggcaagcgc 

gtgcagccag 

ctacatcttg 

aggtggacga 

cagccggaca 

gcgcttcctt 

acaaacacga 

cagaagctgc 

gcagcttcgt 

aaggtcaggt 

atactgaact 



gcggcagtcc 

atatcccccg 

ctccagttta 

ctcagccaat 

caagcacctg 

gagtgcatct 

gctggagtgt 

gggccttctc 

cagcgtgcac 

ctgcagccac 

gccaggtggc 

aagttggagc 

gtttcgcttc 

ctgcgacaca 

ttcatccggg 

agtcaggtac 

acgggctgca 

gagcgttggt 

ggattgggga 

aggatgtgaa 

atctcctttt 

aaaatggctc 

ccaagaggtc 



FIGURE 18A 



Haema tococcus pluvialls IpL2 

tggaacctgg cccggcggca gtccgatgcc 
cagaggcctc acgcatatcc cgcgcgtgaa 
gtgcacacgc gcgactccag tttaagctca 
gaggaccgca cagaccacat gaggggtgca 
gtcgcaggat gagctgatgc tgaaggacga 
acgacaacat cacaggccat gccagcaagc 
ccacatcagc ctgcaggcct gctgcaccgg 
tgacgaccag gggcgactgc tgctgcaaca 
ccttcccaag tgtgtggacg aacacctgct 
cagaccccag atgaggtgga ccaactaagc 
acctggcgca aaagctgctg ccatccgcaa 
taccagcgca ccagctgccg. gcaagcgcgt 
cactactgtg ccgcggacgt gcagccggct 
gggcgagcac gagatggact acatcctatt 
tggcgcccaa ccctgacgag gtggacgaag 
gagctgcggc agatgatgca gccggacaac 
gtttcgcatc accgccgcgc gcttccttga 
acgcggccct aaacactgac aaacacgagg 
atcaacgaag cgtgaaggca gaagctgcag 
<3<3tggaattg cgtacttggc agcttcgtat 
cctgcagagc tagagtcaac ggrgcatcac 
tttagactaa tctgtagcta gagtcactga 
aaaaaaaaa 



gcgatgcttc 

ctccgcccag 

ggagcatgca 

agcacctggg 

gtgcatctta 

tggagtgcca 

gccttctctg 

gcgtgcacgc 

gcagccaccc 

caggtggccg 

gttggagcac 

ttcgcttcct 

gcgacacaat 

catccgggcc 

tcaggtacgt 

gggttgcaat 

gcgttggtgg 

actggggaac 

gatgtgaaga 

ctcctttttc 

attcatcgtc 

tgaacccttt 



gttcgttgct 
cagcccagct 
gctgcttgcc 
caggcgggca 
gtggatgctg 
caaattccta 
tgttcctgtt 
tcaaaaatca 
tctacatggg 
acggcacagt 
gagctgggga 
cacgcgtttg 
cagcgctctg 
aacgtcacct 
gacgcaagag 
ggtcgccgtg 
gctgacctgg 
ggtgcatcac 
cacgtcatgg 
tgagactgaa 
tctcttttgt 
acaaccttca 



FIGURE 18B 
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sativa Ipil 
tgccaaaatg 
cttctccata 
atgccgtctc 
cgctatgggc 
atgacgaatg 
actaaataca 
acacagagca 
ttcagcaacg 
acgtgttgca 
cgcccttggg 
gcatccctgg 
attctataca 
ttacctgctg 
aagtgaaaga 
aggaaggcgg 
attgattgtc 
gaaccctaac 
aaacactaca 
gaaattgaaa 
tcaacttctt 



ttgaaatttc 

ttcttccttc 

tcgcagccgc 

gattccagca 

cattttggtg 

attgtcactt 

ttcagtgtgt 

ttctgcaacc 

gccatccact 

gtgaggaatg 

agcagatgtt 

aggccgcatc 

tttatggtac 

tgtaaaatat 

atgctggtga 

gataatttct 

cgaagctatt 

ctagtaggag 

ttcagatgaa 

tttgctacct 



ccccttttaa 

ttgctgcctc 

tagtgttttc 

tggatgctgt 

gatgagaatg 

gatggagaag 

tcttgttcaa 

aaggtgactt 

atacagggag 

ctgcacagag 

ccggttgatg 

ggatggaaag 

gtgatgttgg 

gtgaaccggg 

agagggtgtg 

tgtttcagtg 

gatatgaaaa 

agaggattat 

tgcttgtatt 

tatcagaaaa 



aaccattgct 

ggaaatcttc 

ctccaccctc 

ccagcgacgt 

acaaagtggt 

attgaaaagg 

ctcgaaatat 

tccctttggt 

agrgagctta 

gaagctcctg 

agttcactcc 

tggggagaac 

tttggatccg 

aagagctgaa 

aagctgtccc 

gtgggatcga 

caatccacaa 

atgagatatt 

tatttctatt 

aaaaa 



accatgatct 

tttccctcca 

tttcgtctgc 

ctcatgttcg 

tggccatgat 

gaaatatgct 

gaattactcc 

atggacaaac 

ttgacgaaaa 

gatgaactcg 

attgggtcgc 

atgaacttga 

aacccagatg 

ggaattggta 

cgtggttcaa 

ctccataagg 

actcacataa 

tgttatatgt 

tggacaaact 



FIGURE 19A 



Laccuca 

5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 
80 
85 
90 
95 



sativa Ipi2 
tattcgcttc 
ggtcttaatc 
tgacgattcc 
aatgcatttt 
cacaattgtc 
agcattcagt: 
aaaggtcaga 
tgcagccatc 
tggggtcaga 
ctgctgaaga 
tacaaggctc 
actcttcctc 
cggacattag 
gcggatgcgg 
ggtggacaac 
tcaatgaagc 
gttaatattt 
tcggtccttc 
tattgtgtac 
aaagaaccac 



aaaatctctt 
tccgccggcg 
ggcatggacg 
ggttgatgaa 
acttgatgga 
gtatttttat 
aaccaaggtg 
cactatacag 
aatgctgcac 
tgttccagtt 
catctgatgg 
gtgcgtgacg 
atacgtgaac 
gtgaggaggg 
ttcttgttca 
aattgacatg 
atggtggtgg 
tttttttaac 
ttgtaacgta 
cgttaattta 



ccattaactg 
cactttcacc 
ctgtccagag 
aatgacaatg 
gaagatcgag 
tcaattcaaa 
acatttcctt 
agaatcggag 
agaggaagct 
gatgagttca 
aaaatggggt 
ttgccgtgaa 
caagaagagt 
tttgaaattg 
aatggtggga 
aaaaccattc 
tttggagcta 
gttttttttt 
ggccctttgg 
aaaaaaaaaa 



ctcaaatctc 
accataaccg 
acgtctcatg 
ttcttgggca 
aaagataatt 
atacgaatta 
tggtatggac 
ttaattcccg 
tctagatgaa 
caactttagg 
gaacatgaag 
cccaaaccct 
taaaagagtt 
tccccatggt 
tcatgtccaa 
ataagttgat 
ataatttgtg 
tttcttttat 
ttacgcttta 
aaaaaaaa 



caccttcgcc 
ccgccatggg 
tttgatgatg 
tgataccaaa 
tgcttcatag 
crccttcagc 
aaacacctgt 
aaaatgccct 
ctcggtatcc 
tcgcatgttg 
ttgattacct 
gatgaggtgg 
actaaggaag 
ttaggctagt 
aaggggacac 
atgaaaaatg 
tgttcaagtc 
tgggagtgtt 
agagtttaat 



FIGURE 19B 
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Chlamydomonas ceinhardcil Ipil 
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FIGURE 20 
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ececta ipil 
ccaaaaacaa 
gactccggca 
cattttggtg 
attgtcactt 
ttcagcgttt 
gtctgcaacc 
gccatccact 
gtaagaaatg 
tgaagatgtt 
aggctccatc 
ttcatagtga 
tatcaaatat 
atgcggggga 
gataacttct 
tgaagcaatt 
tcaaccgaaa 
caattgtttt 
ttcttgtctt 
gaaccatcga 
aaaaaa 



ctcaaatctc 

tggatgctgt 

gatgagtgtg 

gatggagaag 

ttctattcaa 

aaggtgacat 

ctacagagaa 

ctgcacagag 

cccgttgatc 

tgatggaaag 

gagacgttgc 

gtganccang 

ggagggtttg 

tgttcaagtg 

gatatgaaaa 

agttcaagcc 

tttcttttaa 

ttgtgtgtaa 

tggtttgcaa 



ctccgtcgct 

tcagcgacgt: 

acaatgtggt 

attgaaacag 

ttcaaaatac 

ttcctttagt 

tccgagcttg 

gaagctgttg 

agtttactcc 

tggggagaac 

tgtaaacccg 

aagagttaaa 

aagctgtctc 

gtgggatcat 

ccatacacaa 

taataattcg 

gaagttttaa 

gatcttgggt 

tttcaagttc 



cttactccgc 

ctcatgtttg 

gggacatgat 

gtaaaatgct 

gagttacttc 

atggaccaac 

ttcccgaaaa 

gatgaactcg 

tttaggtcgc 

atgaacttga 

aacccagatg 

ggagctgcta 

catggttcag 

gtgcaaaagg 

gctgatatag 

ggttgggtcg 

tctctatttg 

ttcgtttcag 

ctatcgacat 



catgggtgac 

acgatgaatg 

accaaataca 

gcacagagca 

ttcagcaacg 

acctgttgca 

cgcccttgga 

gtatccctgc 

atgctctaca 

ctacctactt 

aagtggcgga 

aggaaagcag 

gttagtggtt 

gtacactcac 

aaacacaccc 

ggtctaccat 

agcatgttga 

ttgtaataat 

gtagtgatct 



FIGURE 21 A 



Oryza satlva Xpil 

cctccctttg 
ggccggcgcc 
agcggctcat 
gttgttggcc 
atctgaaaat 
aatatgaact 
ctagtttgga 
gcttatacag 
tcttggatga 
acccctcttg 
tgaacacgag 
tcccgaaccc 
ctgaaggagc 
gtctccctgg 
atcacgtcga 
cacaagctga 
aagactctgt 
gaagtcagaa 
tgggctctgc 
gcaaacttct 
agaacaatag 
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cctcgcgcag 

gccgccgccg 

gttcgacgac 

atgaatcaaa 

ctacttcata 

cctactccag 

ccaacacttg 

gaaaactacc 

gctgggcatc 

gtcggatgct 

cttgactacc 

ggacgaagtg 

tcatccgcaa 

ttCcggctgg 

gaaaggcacc 

agtaaggact 

tcttgtgctg 

gaagcttttg 

tgactgagag 

atattataca 

catgttccat 



aggcggccgc 

tggaggacgc 

gaatgcattt 

atataactgc 

gggctttcag 

caacgatctg 

ctgcagccat 

ttggtgttag 

ccagctgaag 

ttacaaggcc 

tgctgttcat 

gccgatgtga 

agcggacgcc 

ttgttgacaa 

ctcaacgagg 

gcgatgttgt 

ctgcatatta 

tatgtttctg 

attcccttat 

tgattagtta 

gtttatttgc 



gccttctccg 

cgggatggac 

tggtggatga 
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GAP program of Genetics Computer Group 
blosum62.cmp 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 2728 Length: 530 

R^tio: 5.147 Gaps: 0 

Percent Similarity: 99.623 Percent Identity: 99.057 

Match display thresholds m for the alignment (s) : | - IDENTITY : - 2 - l 

Adonis palaestina e-cyclase »3 x Adonis palaestina E-cyclase #5 

1 MELLGVRNLISSCPVWTFGTRNLsisSKLAYNIHRYGSSCRVDFQVRADGG 50 

N I I I I I ! II I I I M I M I I I I I II I I I I I M It I I I I I I | ! M | | | | | . 
1 MELLGVRNLISSCPVWT FGTRNLSSSKLAYNIHRYGSSCRVDFQVRADGG 50 

51 SGSRTSVAYKEGFVDEEDFIKAGGSELLFVQMQQTKSMEKQAKLADKLPP 100 

e M I I . I I I I I I I I I I I I II I I I I I I I I I | | | | | | | | | M I I I I M I II II 

51 SGSRSSVAYKEGFVDEEDFIKAGGSELLFVQMQQTKSMEKQAKLADKLPP 100 

101 IPFGESVMDLWIGCGPAGLSLAAEAAKLGLKVGLIGPDLPFTNNYGVWE 150 

_ H I II I I I I I M I I I I I I I I I I I I I I | I M I I I I I I Ml I I I I I I M I I I 

101 IPFGESVMDLWIGCGPAGLSLAAEAAKLGLKVGLIGPDLPFTNNYGVWE 150 

151 DEFKDLGLERCIEHAWKDTIVYLDNDAPVLIGRAYGRVSRHLLHEELLKR 200 

_ N I I I I I I I I I I II I II I I I I I ! I | | N 1 I 1 I I I | | M M M I I I I I I t I 
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201 CVESGVSYLNSKVERITEAGDGHSLWCENDIFIPCRLATVASGAASGKL 250 
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„ l : N III I III I II l-l I IN I UN | mm in Mill Nil I III I Ml 

351 KVYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSWRSLSEAPK 4 00 

401 YASVIAKILKQDNSAYWSGQSSAVNISMQAWSSLWPKERKRQRAFFLFG 4 50 

m 1 1 n 1 1 1 1 1 1 1 1 1 1 n n 1 1 1 1 n 1 1 1 n n 1 1 1 1 n 1 1 1 1 1 1 1 1 1 1 1 

401 YASVIAKILKQDNSAYWSGQSSAVNISMQAWSSLWPKERKRQRAFFLFG 4 50 
• 

451 LEL I VQLO IEATRT FFRT FFRLPTWMWWGFLGSSLSS FDLVLFSMYMFVL 500 

I M I II M M II I I II M I II II II M I II II II II I II II II I I I I II I 
451 LEL I VQLD IEATRT FFRT FFRLPTWMWWGFLGSSLSS FDLVLFSMYMFVL 500 

501 APNSMRMSLVRHLLSDPSGAVMVKAYLER* 530 

M II I I I I I I I I I I II I I I I M I : I I I || I 
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FIGURE 28 

GAP of-. Arabidopsis epsiion cyclase to Lettuce epsilon cyclase 

Gap Weight: 12 Average Match: 2.9X2 

Length Weight: 4 Average Mismatch: -2.003 

Quality: 1837 Length: 534 

Ratio: 3.499 GaDs ■ » 

Percent Similarity: 76.381 Percent Identity? 6 9. 905 

Match display thresholds for the alignment (s) - 
I = IDENTITY 
: = 2 
. = 1 

Arabidopsis x Lettuce 

1 MECFGARNMTATMAVFTCPRFTDCNIRHKFSLLKQIU15TNLSASSSLRQI 50 
47 SGGGSSGSESCVAVREDFADEEDFVKAGGSEILFVQMQQNKDMDEQSKLV 96 
SI KCSAKSDRCyVDKQGISVADEEDYV^GGSELFFV 10| 
97 D ^PPIS1GDGALDHVVIGCGPAGIJU^SAKLGLKVGLIGPDLPFTNN 146 

- ^u»=*jl^^^ :;: 

i,i M^i^^i^i^^M m m i^im joo 

i S , ^s?r s fir D nTT D Hir^irtf?r' 24s 

247 WWTOBBC^ 

2si ^U^K^iiMn Jimm^ 3oo 

assess: 

397 SEAPKyASyiAE ILREETTKQINS NISRQAWDTLWPPERKRQRAF 441 

401 SEAPNYAAVIAKILRQDQSKEMISLGKYTNISKQAWETLWP 450 
442 T^^^^yyQ^^EGlRSFFRTFFRLPKWIWQGFLGSTLTSGDLVLFALY 491 

4si jwli4j^^^ ;:: 

492 MFVISPNNLRKGLINHLISDPTGATMIKTYLKV* 525 

"H-I--II |: | | : | | | | | | | | : I II : | 
501 MFVIAPHSLRMELVRHLLSDPTGATMVKAYLTI * S34 



6NSDOCID: <WO 99630S5A1_I_> 



WO 99/63055 




PCT/US99/12121 



SEQUENCE LISTING 



<110> CUNNINGHAM JR., FRANCIS X. 
SUN, ZAIREN 

<120> GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM AND 
METHODS OF USE THEREOF 

<130> 8172-9023 

<140> NOT YET ASSIGNED 
<141> 1999-06-02 

<150> 09/088,724 
<151> 1998-06-02 

<150> 09/088,725 
<151> 1998-06-02 

<160> 61 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 1860 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 

<222> (109) . . (1680) 
<400> 1 

ACAAAAGGAA AT AAT TAG AT TCCTCTTTCT GCTTGCTATA CCTTGATAGA ACAATATAAC 60 

AATGGTGTAA GTCTTCTCGC TGTATTCGAA ATTATTTGGA GGAGGAAA ATG GAG TGT 117 

Met Glu Cys 
1 

GTT GGG GCT AGG AAT TTC GCA GCA ATG GCG GTT TCA ACA TTT CCG TCA 165 
Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr Phe Pro Ser 
5 10 15 

TGG AGT TGT CGA AGG AAA TTT CCA GTG GTT AAG AGA TAC AGC TAT AGG 213 
Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr Ser Tyr Arg 
20 25 30 " " 35 

AAT ATT CGT TTC GGT TTG TGT AGT GTC AGA GCT AGC GGC GGC GGA AGT 261 
Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly Gly Gly Ser 
40 45 50 

TCC GGT AGT GAG AGT TGT GTA GCG GTG AGA GAA GAT TTC GCT GAC GAA 309 
Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe Ala Asp Glu 
55 60 65 

GAA GAT TTT GTG AAA GCT GGT GGT TCT GAG ATT CTA TTT GTT CAA ATG 357 
Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe Val Gin Met 
70 75 80 

CAG CAG AAC AAA GAT ATG GAT GAA CAG TCT AAG CTT GTT GAT AAG TTG 4 05 



\ 
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Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val Asp Lys Leu 

85 90 .95 

CCT CCT ATA TCA ATT GGT GAT GGT GCT TTG GAT CAT GTG GTT ATT GGT 4 53 

Pro Pro He Ser He Gly Asp Gly Ala Leu Asp His Val Val He Glv 
100 105 HO us 

TGT GGT CCT GCT GGT TTA GCC TTG GCT GCA GAA TCA GCT AAG CTT GGA 501 
Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Glv 
120 125 130 

TTA AAA GTT GGA CTC ATT GGT CCA GAT CTT CCT TTT ACT AAC AAT TAC 54 9 

Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
135 140 145 

GGT GTT TGG GAA GAT GAA TTC AAT GAT CTT GGG CTG CAA AAA TGT ATT 597 
Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys He 
150 155 ■ 160 

GAG CAT GTT TGG AGA GAG ACT ATT GTG TAT CTG GAT GAT GAC AAG CCT 64 5 

Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp Asp Lys Pro 
165 170 175 

ATT ACC ATT GGC CGT GCT TAT GGA AGA GTT AGT CGA CGT TTG CTC CAT 693 
He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His 
180 185 190 195 

GAG GAG CTT TTG AGG AGG TGT GTC GAG TCA GGT GTC TCG TAC CTT AGC 741 
Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser 
200 205 210 

TCG AAA GTT GAC AGC ATA ACA GAA GCT TCT GAT GGC CTT AGA CTT GTT 789 
Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu Arg Leu Val 
215 220 ' 225 

GCT TGT GAC GAC AAT AAC GTC ATT CCC TGC AGG CTT GCC ACT GTT GCT 837 
Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala Thr Val Ala 
230 235 " 240 

TCT GGA GCA GCT TCG GGA AAG CTC TTG CAA TAC GAA GTT GGT GGA CCT 885 
Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro 
245 250 255 

AGA GTC TGT GTG CAA ACT GCA TAC GGC GTG GAG GTT GAG GTG GAA AAT 933 
Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn 
260 265 270 275 

AGT CCA TAT GAT CCA GAT CAA ATG GTT TTC ATG GAT TAC AGA GAT TAT 981 
Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr 
280 285 290 

ACT AAC GAG AAA GTT CGG AGC TTA GAA GCT GAG TAT CCA ACG TTT CTG 1029 
Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu 
295 300 " 305 

TAC GCC ATG CCT ATG ACA AAG TCA AGA CTC TTC TTC GAG GAG ACA TGT 1077 
Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys 
310 .315 320 

TTG GCC TCA AAA GAT GTC ATG CCC TTT GAT TTG CTA AAA ACG AAG CTC 1125 
Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu 



Z. 
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ATG TTA AGA TTA GAT ACA CTG GGA ATT CGA ATT CTA AAG ACT TAC GAA 
Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys Thr Tyr Glu 
340 345 350 355 

GAG GAG TGG TCC TAT ATC CCA GTT GGT GGT TCC TTG CCA AAC ACC GAA 1221 
Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 
360 365 370 

CAA AAG AAT CTC GCC TTT GGT GCT GCC GCT AGC ATG GTA CAT CCC GCA 12 69 

Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
375 380 385 

ACA GGC TAT TCA GTT GTG AGA TCT TTG TCT GAA GCT CCA AAA TAT GCA 1317 
Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Tyr Ala 
390 395 400 

TCA GTC ATC GCA GAG ATA CTA AGA GAA GAG ACT ACC AAA CAG ATC AAC 1365 
Ser Val lie Ala Glu lie Leu Arg Glu Glu Thr Thr Lys Gin lie Asn 
405 410 415 

AGT AAT ATT TCA AGA CAA GCT TGG GAT ACT TTA TGG CCA CCA GAA AGG 1413 
Ser Asn lie Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro Pro Glu Arg 
420 425 430 435 

AAA AGA CAG AGA GCA TTC TTT CTC TTT GGT CTT GCA CTC ATA GTT CAA 14 61 

Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu lie Val Gin 
440 445 450 

TTC GAT ACC GAA GGC ATT AGA AGC TTC TTC CGT ACT TTC TTC CGC CTT 1509 
Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe Phe Arg Leu 
4 55 4 60 4 65 

CCA AAA TGG ATG TGG CAA GGG TTT CTA GGA TCA ACA TTA ACA TCA GGA 1557 
Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu Thr Ser Gly 
470 475 480 

GAT CTC GTT CTC TTT GCT TTA TAC ATG TTC GTC ATT TCA CCA AAC AAT 1605 
Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser Pro Asn Asn 
485 490 495 

TTG AGA AAA GGT CTC ATC AAT CAT CTC ATC TCT GAT CCA ACC GGA GCA 1653 
Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro Thr Gly Ala 
500 " " 505 510 515 

ACC ATG ATA AAA ACC TAT CTC AAA GTA TGATTTACTT ATCAACTCTT 1700 
Thr Met lie Lys Thr Tyr Leu Lys Val 
520 

AGGTTTGTGT AT AT AT ATG T TGATTTATCT G AAT AAT C G A TCAAAGAATG GTATGTGGGT 17 60 

TACTAGGAAG TTGGAAACAA AC ATG TAT AG AATCTAAGGA GTGATCGAAA TGGAGATGGA 1820 

AACGAAAAGA AAAAAATCAG TCTTTGTTTT GTGGTTAGTG 1860 



<210> 2 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
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<4 00> 2 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
1 5 10. 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser Tyr Arg Asn He Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Glv 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 -90 " 95 

Asp Lys Leu Pro Pro He Ser He Gly Asp Gly Ala Leu Asp His Val 
100 105 no 

Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 160 

Lys Cys lie Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 ~ 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 
225 230 235 * " 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
3 °5 310 315 320 
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Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
34 0 34 5 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro 
355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 " 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lys Tyr Ala Ser Val lie Ala Glu lie Leu Arg Glu Glu Thr Thr Lys 
405 410 ~ 415 

Gin lie Asn Ser Asn lie Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425 * " 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe ■ Leu Phe Gly Leu Ala Leu 
435 440 445 

lie Val Gin Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 ** 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser 
485 490 495 . 

Pro Asn Asn Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met lie Lys Thr Tyr Leu Lys Val 
515 520 

<2-10> 3 
<211> 956 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 3 

GCTCTTTCTC CTCCTCCTCT ACCGATTTCC GACTCCGCCT CCCGAAATCC TTATCCGGAT 60 
TCTCTCCGTC TCTTCGATTT AAACGCTTTT CTGTCTGTTA CGTCGTCGAA GAACGGAGAC 120 
AGAATTCTCC GATTGAGAAC GATGAGAGAC CGGAGAGCAC GAGCTCCACA AACGCTATAG 180 
ACGCTGAGTA TCTGGCGTTG CGTTTGGCGG AGAAATTGGA GAGGAAGAAA TCGGAGAGGT 24 0 

CCACTTATCT AATCGCTGCT ATGTTGTCGA GCTTTGGTAT CACTTCTATG GCTGTTATGG 300 
CTGTTTACTA CAGATTCTCT TGGCAAATGG AGGGAGGTGA GATCTCAATG TTGGAAATGT 360 
TTGGTACATT TGCTCTCTCT GTTGGTGCTG CTGTTGGTAT GGAATTCTGG GCAAGATGGG 420 
CTCATAGAGC TCTGTGGCAC GCTTCTCTAT GGAATATGCA TGAGTCACAT CACAAACCAA 4 80 
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GAGAAGGACC GTTTGAGCTA AACGATGTTT TTGCTATAGT £AACGCTGGT CC AG CG AT TG 54 0 

GTCTCCTCTC TTATGGATTC TTCAATAAAG GACTCGTTCC TGGTCTCTGC TTTGGCGCCG 600 

GGTTAGGCAT AACGGTGTTT GGAATCGCCT ACATGTTTGT CCACGATGGT CTCGTGCACA 660 

AGCGTTTCCC TGTAGGTCCC ATCGCCGACG TCCCTTACCT CCGAAAGGTC GCCGCCGCTC 720 

ACCAGCTACA TCACACAGAC AAGTTCAATG GTGTACCATA TGGACTGTTT CTTGGACCCA 780 

AGGAATTGGA AGAAGTTGGA GGAAATGAAG AG T T AG AT AA GGAGATTAGT CGGAGAATCA 84 0 

AATCATACAA AAAGGCCTCG GGCTCCGGGT CGAGTTCGAG TTCTTGACTT TAAACAAGTT 900 

TTAAATCCCA AATTCTTTTT TTGTCTTCTG TCATTATGAT CATCTTAAGA CGGTCT 956 

<210> 4 

<211> 294 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 4 

Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 
1 5 10 15 

Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
20 25 30 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro lie Glu Asn Asp Glu 
35 40 45 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala He Asp Ala Glu Tyr Leu 
50 55 60 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
65 70 75 80 

Thr Tyr Leu He Ala Ala Met Leu Ser Ser Phe Gly He Thr Ser Met 
85 90 ^ 95 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
100 105 HO 

Glu He Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
115 120 125 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
130 135 140 

Trp His Ala Ser Leu Trp Met Asn His Glu Ser His His Lys Pro Arq 
i45 150 155 160 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He Val Asn Ala Gly 
165 170 175 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
180 185 190 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly He 
195 200 205 
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Ala Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Pro Val 
210 215 220 

Gly Pro lie Ala Asp Val Pro Tyr Leu Arg' Lys Val Ala Ala Ala His 
225 230 235 240 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
245 250 255 

Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asp 
260 265 270 

Lys Glu lie Ser Arg Arg lie Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
275 280 285 

Gly Ser Ser Ser Ser Ser 
290 

<210> 5 
<211> 162 
<212> PRT 

<213> Alicalgenes sp. 
<400> 5 

Met Thr Gin Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu 
1 5 10 15 

Thr Ala Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp 
20 25 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 40 45 

Asn Asp Leu Tyr Gly Val Val Phe Ala Val Leu Ala Thr He Leu Phe 
50 55 60 

Thr Val Gly Ala Tyr Trp Trp Pro Val Leu Trp Trp He Ala Leu Gly 
65 70 75 80, 

Met Thr Val Tyr Gly Leu He Tyr Phe He Leu His Asp Gly Leu Val 
85 90 95 

His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Arg Gly Tyr Phe Arg 
100 105 HO 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 120 125 

His Cys Val Ser Phe Gly Phe He Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Arg Ser Gly Val Leu Arg Pro Gin Asp Glu Arg 
145 150 155 * 160 

Pro Ser 

<210> 6 
<211> 175 
<212> PRT 

<213> Erwinia herbicola 
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<4 00> 6 

Met Leu Asn Ser Leu He Val He Leu Ser Val He Ala Met Glu Gly 
1 5 10 15 

He Ala Ala Phe Thr His Arg Tyr He Met His Gly Trp Gly Trp Arg 
20 25 30 

Trp His Glu Ser His His Thr Pro Arg Lys Gly Val Phe Glu Leu Asn 
35 40 45 

Asp Leu Phe Ala Val Val Phe Ala Gly Val Ala He Ala Leu He Ala 
50 55 60 

Val Gly Thr Ala Gly Val Trp Pro Leu Gin Trp He Gly Cys Gly Met 
65 70 75 J 80 

Thr Val Tyr Gly Leu Leu Tyr Phe Leu Val His Asp Gly Leu Val His 
85 90 J 95 

Gin Arg Trp Pro Phe His Trp lie Pro Arg Arg Gly Tyr Leu Lys Arg 
100 105 no 

Leu Tyr Val Ala His Arg Leu His His Ala Val Arg Gly Arg Glu Gly 
H5 120 125 

Cys Val Ser Phe Gly Phe He Tyr Ala Arg Lys Pro Ala Asp Leu Gin 
130 135 140 

Ala He Leu Arg Glu Arg His Gly Arg Pro Pro Lys Arg Asp Ala Ala 
145 150 155 160 

Lys Asp Arg Pro Asp Ala Ala Ser Pro Ser Ser Ser Ser Pro Glu 
165 170 175 

<210> 7 
<211> 175 
<212> PRT 

<213> Erwinia uredovora 
<400> 7 

Met Leu Trp He Trp Asn Ala Leu He Val Phe Val Thr Val He Gly 
15 10 15 

Met Glu Val He Ala Ala Leu Ala His Lys Tyr He Met His Glv Trp 
20 25 30 

Gly Trp Gly Trp His Leu Ser His His Glu Pro Arg Lys Gly Ala Phe 
35 40 45 

Glu Val Asn Asp Leu Tyr Ala Val Val Phe Ala Ala Leu Ser lie Leu 
50 55 60 

Leu He Tyr Leu Gly Ser Thr Gly Met Trp Pro Leu Gin Trp lie Gly 
65 70 75 80 

Ala Gly Met Thr Ala Tyr Gly Leu Leu Tyr Phe Met Val His Asp Gly 
85 90 95 

Leu Val His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys -Gly Tyr 
100 105 no 
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Leu Lys Arg Leu Tyr Met Ala His Arg Met His His Ala Val Arg "(Sly 
115 120 125 

Lys Glu Gly Cys Val Ser Phe Gly Phe Leu' Tyr Ala Pro Pro Leu Ser 
130 . 135 140 

Lys Leu Gin Ala Thr Leu Arg Glu Arg His Gly Ala Arg Ala Gly Ala 
145 150 155 160 

Ala Arg Asp Ala Gin Gly Gly Glu Asp Glu Pro Ala Ser Gly Lys 
165 170 " 175 

<210> 8 
<211> 162 
<212> PRT 

<213> Agrobacterium aurianticum 
<400> 8 

Met Thr Asn Phe Leu He Val Val Ala' Thr Val Leu Val Met Glu Leu 
1 5 10 15 

Thr Ala Tyr Ser Val His Arg Trp He Met His Gly Pro Leu Gly Trp 
20 25 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 40 4 5 

Asn Asp Leu Tyr Gly Leu Val Phe Ala Val He Ala Thr Val Leu Phe 
50 55 60 

Thr Val Gly Trp He Trp Ala Pr.o Val Leu Trp Trp lie Ala Leu Gly 
65 70 75 80 

Met Thr Val Tyr Gly Leu lie Tyr Phe Val Leu His Asp Gly Leu Val 
85 90 95 

His Trp Arg Trp Pro Phe Arg Tyr He Pro Arg Lys Gly Tyr Ala Arg 
100 105 110 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 120 125 

His Cys Val Ser Phe Gly Phe lie Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Met Ser Gly Val Leu Arg Ala Glu Ala Gin Glu 
145 150 155 160 

Arg Thr 



<210> 9 
<211> 954 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 9 

CCACGGGTCC GCCTCCCCGT TTTTTTCCGA TCCGATCTCC GGTGCCGAGG ACTCAGCTGT 60 
TTGTTCGCGC TTTCTCAGCC GTCACCATGA CCGATTCTAA CGATGCTGGA ATGGATGCTG 120 
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1 1 t AIdAQjACCj 


ACTCATGTTT 


GAAGACGAAT 


GCATTCTCGT 


TGATGAAAAT 


AATCGTGT-GG 


180 


I GUljACA I GA 


CACTAAGTAT 


AACTGTCATC 


TGATGGAAAA 


GATTGAAGCT 


GAGAATTTAC 


240 


<T» rr> /-» tv TV TV /"» 

1 1 CACAGAGC 


TTTCAGTGTG 


TTTTTATTCA 


ACTCCAAGTA 


TGAGTTGCTT 


CTCCAGCAAC 


300 


Kd\j i UAAAAAC 


AAAGGTTACT 


TTCCCACTTG 


TGTGGACAAA 


CACTTGTTGC 


AGCCATCCTC 


360 


nnrprpTv /~»r^/^rp/-> tv 


TV rp /•*» /~» /— » TV y"»/Tr< rp 

ATCCGAGCTT 


ATTGAAGAGA 


ATGTGCTTGG 


TGTAAGAAAT 


GCCGCACAAA 


420 


O VJJ/-V^-\V3 1111 


tort 1 JjAvjU 1 L- 


GG1A1 IG1AG 


TV /•"» TV 7\ r~» T\ rp rp 

CAGAAGATGT 


ACCAGTCGAT 


GAGTTCACTC 


480 




Lnlotl 1 1 ML- 




G 1 1 GGGAA 


ATGGGGAGAG 


CACGAAGTTG 


540 




1 LAlLo 1 G 


CGGG A 1 G I GA 


TV /*"• rp rp x~» TV TV 

AGCTTCAACC 


AAACCCAGAT 


GAAGTGGCTG 


600 


rton 1 1 A 


GG 1 GAGGAGG 


GAAGAGG 11 A 


AGGAGCTGGT 


GAAGAAAGCA 


GATGCTGGCG 


660 




G AAAC 


CCA1 GGI 1 CA 


GATTGGTGGT 


GGATAATTTC 


TTGATGAAGT 


720 


GGTGGGATCA 


TGTTGAGAAA 


GGAACTATCA 


CTGAAGCTGC 


AGACATGAAA 


ACCATTCACA 


780 


AGCTCTGAAC 


TTTCCATAAG 


TTTTGGATCT 


TCCCCTTCCC 


ATAATAAAAT 


TAAGAGATGA 


840 


GACTTTTATT 


GATTACAGAC 


AAAACTGGCA 


ACAAAATCTA 


TTCCTAGGAT 


TTTTTTTTGC 


900 


TTTTTATTTA 


CTTTTGATTC 


ATCTCTAGTT 


TAGTTTTCAT 


CTTAAAAAAA 


AAAA 


954 



<210> 10 
<211> 996 
<212> DNA 

<213> Arabidopsis thaliana 



<400> 10 



CACCAATGTC 


TGTTTCTTCT 


TTATTTAATC 


TCCCATTGAT 


TCGCCTCAGA 


TCTCTCGCTC 


60 


TTTCGTCTTC 


TTTTTCTTCT 


TTCCGATTTG 


CCCATCGTCC 


TCTGTCATCG 


ATTTCACCGA 


120 


GAAAGTTACC 


GAATTTTCGT 


GCTTTCTCTG 


GTACCGCTAT 


GACAGATACT 


AAAGATGCTG 


180 


GTATGGATGC 


TGTTCAGAGA 


CGTCTCATGT 


TTGAGGATGA 


ATGCATTCTT 


GTTGATGAAA 


240 


CTGATCGTGT 


TGTGGGGCAT 


GTCAGCAAGT 


ATAATTGTCA 


TCTGATGGAA 


AATATTGAAG 


300 


CCAAGAATTT 


GCTGCACAGG 


GCTTTTAGTG 


TATTTTTATT 


CAACTCGAAG 


TATGAGTTGC 


360 


TTCTCCAGCA 


AAGGTCAAAC 


ACAAAGGTTA 


CGTTCCCTCT 


AGTGTGGACT 


AACACTTGTT 


420 


GCAGCCATCC 


TCTTTACCGT 


GAATCAGAGC 


TTATCCAGGA 


CAATGCACTA 


GGTGTGAGGA 


480 


ATGCTGCACA 


AAGAAAGCTT 


CTCGATGAGC 


TTGGTATTGT 


AGCTGAAGAT 


GTACCAGTCG 


540 


ATGAGTTCAC 


TCCCTTGGGA 


CGTATGCTGT 


ACAAGGCTCC 


TTCTGATGGC 


AAATGGGGAG 


600 


AGCATGAACT 


TGATTACTTG 


CTCTTCATCG 


TGCGAGACGT 


GAAGGTTCAA 


CCAAACCCAG 


660 


ATGAAGTAGC 


TGAGATCAAG 


TATGTGAGCC 


GGGAAGAGCT 


GAAGGAGCTG 


GTGAAGAAAG 


720 


CAGATGCAGG 


TGAGGAAGGT 


TTGAAACTGT 


CACCATGGTT 


CAGATTGGTG 


GTGGACAATT 


780 
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TCTTGATGAA GTGGTGGGAT CATGTTGAGA AAGGAACTTT GGTTGAAGCT AT AG AC AT G A 84 0 

AAACCATCCA CAAACTCTGA ACATCTTTTT TTAAAGTTTT TAAATCAATC AACTTTCTCT 900 

TCATCATTTT TATCTTTTCG ATGATAATAA TTTGGGATAT GTGAGACACT TACAAAACTT 960 

CCAAGCACCT CAGGCAATAA TAAAGTTTGC GGCCGC 996 

<210> 11 
<211> 1165 

<212> DNA \ 
<213> Haeraatococcus pluvialis 



<400> 11 



CTCGGTAGCT 


GGCCACAATC 


GCTATTTGGA 


ACCTGGCCCG 


GCGGCAGTCC 


GATGCCGCGA 


60 


TGCTTCGTTC 


GTTGCTCAGA 


GGCCTCACGC 


ATATCCCCCG 


CGTGAACTCC 


GCCCAGCAGC 


120 


CCAGCTGTGC 


ACACGCGCGA 


CTCCAGTTTA 


AGCTCAGGAG 


CATGCAGATG 


ACGCTCATGC 


180 


AGCCCAGCAT 


CTCAGCCAAT 


CTGTCGCGCG 


CCGAGGACCG 


CACAGACCAC 


ATGAGGGGTG 


240 


CAAGCACCTG 


GGCAGGCGGG 


CAGTCGCAGG 


ATGAGCTGAT 


GCTGAAGGAC 


GAGTGCATCT 


300 


TGGTGGATGT 


TGAGGACAAC 


ATCACAGGCC 


ATGCCAGCAA 


GCTGGAGTGT 


CACAAGTTCC 


360 


TACCACATCA 


GCCTGCAGGC 


CTGCTGCACC 


GGGCCTTCTC 


TGTGTTCCTG 


TTTGACGATC 


420 


AGGGGCGACT 


GCTGCTGCAA 


CAGCGTGCAC 


GCTCAAAAAT 


CACCTTCCCA 


AGTGTGTGGA 


480 


CGAACACCTG 


CTGCAGCCAC 


CCTTTACATG 


GGCAGACCCC 


AGATGAGGTG 


GACCAACTAA 


540 


GCCAGGTGGC 


CGACGGAACA 


GTACCTGGCG 


CAAAGGCTGC 


TGCCATCCGC 


AAGTTGGAGC 


600 


ACGAGCTGGG 


GATACCAGCG 


CACCAGCTGC 


CGGCAAGCGC 


GTTTCGCTTC 


CTCACGCGTT 


660 


TGCACTACTG 


TGCCGCGGAC 


GTGCAGCCAG 


CTGCGACACA 


ATCAGCGCTC 


TGGGGCGAGC 


720 


ACGAAATGGA 


CTACATCTTG 


TTCATCCGGG 


CCAACGTCAC 


CTTGGCGCCC 


AACCCTGACG 


780 


AGGTGGACGA 


AGTCAGGTAC 


GTGACGCAAG 


AGGAGCTGCG 


GCAGATGATG 


CAGCCGGACA 


840 


ACGGGCTGCA 


ATGGTCGCCG 


TGGTTTCGCA 


TCATCGCCGC 


GCGCTTCCTT 


GAGCGTTGGT 


900 


GGGCTGACCT 


GGACGCGGCC 


CTAAACACTG 


ACAAACACGA 


GGATTGGGGA 


ACGGTGCATC 


960 


ACATCAACGA 


AGCGTGAAAG 


CAGAAGCTGC 


AGGATGTGAA 


GACACGTCAT 


GGGGTGGAAT 


1020 


TGCGTACTTG 


GCAGCTTCGT 


ATCTCCTTTT 


TCTGAGACTG 


AACCTGCAGT 


CAGGTCCCAC 


1080 


AAGGTCAGGT 


AAAATGGCTC 


GATAAAATGT 


ACCGTCACTT 


TTTGTCGCGT 


ATACTGAACT 


1140 


CCAAGAGGTC 


AAAAAAAAAA 


AAAAA 








1165 
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<211> 1135 
<212> DNA 

<213> Haematococcus pluvialis 
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<400> 12 
CTCGGTAGCT 


GGCCACAATC 


GCTATTTGGA 


ACCTGGCCCG 


GCGGCAGTCC 


GATGCCGCGA 


60 


TGCTTCGTTC 


GTTGCTCAGA 


GGCCTCACGC 


ATATCCCGCG 


CGTGAACTCC 


GCCCAGCAGC 


120 


CCAGCTGTGC 


ACACGCGCGA 


CTCCAGTTTA 


AGCTCAGGAG 


CATGCAGCTG 


CTTTCCGAGG 


180 


ACCGCACAGA 


CCACATGAGG 


GGTGCAAGCA 


CCTGGGCAGG 


CGGGCAGTCG 


CAGGATGAGC 


240 


TGATGCTGAA 


GGACGAGTGC 


ATCTTGGTAG 


ATGTTGAGGA 


CAACATCACA 


GGCCATGCCA 


300 


GCAAGCTGGA 


GTGTCACAAG 


TTCCTACCAC 


ATCAGCCTGC 


AGGCCTGCTG 


CACCGGGCCT 


360 


TCTCTGTGTT 


CCTGTTTGAC 


GATCAGGGGC 


GACTGCTGCT 


GCAACAGCGT 


GCACGCTCAA 


420 


AAA I L.ACCTT 


CCCAAGTGTG 


TGGACGAACA 


CCTGCTGCAG 


CCACCCTTTA 


CATGGGCAGA 


480 


CUCL.AGATGA 


GGTGGACCAA 


/*>fT» TV TV TV 

CTAAGCCAGG 


TGGCCGACGG 


AACAGTACCT 


GGCGCAAAGG 


540 


v^I CjC 1 (jCCAT 


CCGCAAGTTG 


/""• TV /""» /T TV TV ^-H 

GAGCACGAGC 


TGGGGATACC 


AGCGCACCAG 


CTGCCGGCAA 


600 


bCCjCGTTTCG 


CTTCCTCACG 


CGTTTGCACT 


ACTGTGCCGC 


GGACGTGCAG 


CCAGCTGCGA 


660 


CACAATCAGC 


GCTCTGGGGC 


GAGCACGAAA 


TGGACTACAT 


CTTGTTCATC 


CGGGCCAACG 


720 


TCACCTTGGC 


GCCCAACCCT 


GACGAGGTGG 


ACGAAGTCAG 


GTACGTGACG 


CAAGAGGAGC 


780 


TGCGGCAGAT 


GATGCAGCCG 


GACAACGGGC 


TTCAATGGTC 


GCCGTGGTTT 


CGCATCATCG 


840 


CCGCGCGCTT 


CCTTGAGCGT 


TGGTGGGCTG 


ACCTGGACGC 


GGCCCTAAAC 


ACTGACAAAC 


900 


ACGAGGATTG 


GGGAACGGTG 


CATCACATCA 


ACGAAGCGTG 


AAGGCAGAAG 




you 


TGAAGACACG 


TCATGGGGTG 


GAATTGCGTA 


CTTGGCAGCT 


TCGTATCTCC 


TTTTTCTGAG 


1020 


ACTGAACCTG 


CAGAGCTAGA 


GTCAATGGTG 


CATCATATTC 


ATCGTCTCTC 


TTTTGTTTTA 


1080 


GACTAATCTG 


TAGCTAGAGT 


CACTGATGAA 


TCCTTTACAA 


CTTTCAAAAA 


AAAAA 


1135 


<210> 13 
<211> 960 
<212> DNA 

<213> Tagetes erecta 












<400> 13 
CCAAAAACAA 


CTCAAATCTC 


CTCCGTCGCT 


CTTACTCCGC 


CATGGGTGAC 


GACTCCGGCA 


60 


TGGATGCTGT 


TCAGCGACGT 


CTCATGTTTG 


ACGATGAATG 


CATTTTGGTG 


GATGAGTGTG 


120 


ACAATG TGGT 


GGGACATGAT 


AC C AAAT AC A 


ATTGTCACTT 


GATGGAGAAG 


AT TG AAAC AG 


180 


GTAAAATGCT 


GCACAGAGCA 


TTCAGCGTTT 


TTCTATTCAA 


T TCAAAAT AC 


GAGTTACTTC 


240 


TTCAGCAACG 


GTCTGCAACC 


AAGGTGACAT 


TTCCTTTAGT 


ATGGACCAAC 


ACCTGTTGCA 


300 


GCCATCCACT 


CTACAGAGAA 


TCCGAGCTTG 


TTCCCGAAAC 


GCCTGAGAGA 


ATGCTGCACA 


360 


GAGGANNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


420 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


480 
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NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


540 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


600 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


660 


NNNNNNNNNN 


NNNNNNNNNN 


TCATGTGCAA 


AAGGGTACAC 


TCACTGAATG 


CAATTTGATA 


720 


TGAAAACCAT 


ACACAAGCTG 


ATATAGAAAC 


ACACCCTCAA 


CCGAAAAGCA 


AGCCTAATAA 


780 


TTCGGGTTGG 


GTCGGGTCTA 


CCATCAATTG 


T T T T ^3P 1* 


TTAACAACTT 


TTAATCTCTA 


840 


TTTGAGCATG 


TTGATTCTTG 


TCTTTTGTGT 


GTAAGATTTT 


GGGTTTCGTT 


TCAGTTGTAA 


900 


TAATGAACCA 


TTGATGGTTT 


GCAATTTCAA 


GTTCCTATCG 


ACATGTAGTG 


ATCTAAAAAA 


960 



<210> 14 
<211> 305 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 14 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
1 5 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser lie Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys lie 
65 70 75 80 

Leu Val Asp Val Glu Asp Asn lie Thr Gly His Ala Ser Lys Leu Glu 
85 90 95 

Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu Leu His Arg Ala 
100 105 110 

Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Arg Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp Thr Asn Thr Cys 
130 " 135 140 

Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu 
145 150 155 160 

Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala He 
165 170 175 

Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His Gin Leu Pro Ala 
180 185 190 

Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val 
195 200 205 



\1 



m 
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Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Aso 
210 215 220 

Tyr lie Leu Phe He. Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp 
225 230 235 240 

Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met 
245 250 255 

Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp Phe Arg He He 
260 265 " 270 

Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu 
275 280 285 

Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His His He Asn Glu 
290 295 300 

Ala 
305 

<210> 15 
<211> 293 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 15 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His He Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30. 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arq Glv 
35 40 45 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu Cys He Leu Val Asp Val Glu Asp Asn He Thr Glv His Ala 
65 70 75 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 
85 90 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arq Leu 
100 105 no 

Leu Leu Gin Gin Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp 
115 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lvs 
145 150 155 160 

Ala Ala Ala He Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His 
165 170 * 175 

Gin Leu Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys 
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180 185 190 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu 
195 . 200 205 

His Glu Met Asp Tyr lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala 
210 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 230 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 255 

Phe Arg lie lie Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu 
260 265 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 
275 280 285 

His lie Asn Glu Ala 
290 

<210> 16 
<211> 284 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 16 

Met Ser Val Ser Ser Leu Phe Asn Leu Pro Leu lie Arg Leu Arg Ser 
15 10 15 

Leu Ala Leu Ser Ser Ser Phe Ser Ser Phe Arg Phe Ala His Arg Pro 
20 25 30 

Leu Ser Ser lie Ser Pro Arg Lys Leu Pro Asn Phe Arg Ala Phe Ser 
35 40 45 

Gly Thr Ala Met Thr Asp Thr Lys Asp Ala Gly Met Asp Ala Val Gin 
50 55 60 

Arg Arg Leu Met Phe Glu Asp Glu Cys lie Leu Val Asp Glu Thr Asp 
65 70 75 80 

Arg Val Val Gly His Val Ser Lys Tyr Asn Cys His Leu Met Glu Asn 
85 90 95 

lie Glu Ala Lys Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe 
100 105 110 

Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Asn Thr Lys Val 
115 120 125 

Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr 
130 135 140 

Arg Glu Ser Glu Leu lie Gin Asp Asn Ala Leu Gly Val Arg Asn Ala 
145 150 155 " 160 

Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Val Ala Glu Asp Val 
165 170 175 
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Pro Val Asp Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro 
180 185 190 

Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe He 
195 200 205 

Val Arg Asp Val Lys Val Gin Pro Asn Pro Asp Glu Val Ala Glu He 
210 215 220 

Lys Tyr Val Ser Arg Glu Glu Leu Lys Glu Leu Val Lys Lys Ala Asp 
225 230 235 240 

Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val 
245 250 * 255 

Asp Asn Phe Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr Leu 
260 265 270 

Val Glu Ala He Asp Met Lys Thr He His Lys Leu 
275 280 

<210> 17 
<211> 287 
<212> PRT 

<213> Clarkia breweri 
<400> 17 

Met Ser Ser Ser Met Leu Asn Phe Thr Ala Ser Arg He Val Ser Leu 
1 5 10 15 

Pro Leu Leu Ser Ser Pro Pro Ser Arg Val His Leu Pro Leu Cys Phe 
20 25 30 

Phe Ser Pro He Ser Leu Thr Gin Arg Phe Ser Ala Lys Leu Thr Phe 
35 40 45 

Ser Ser Gin Ala Thr Thr Met Gly Glu Val Val Asp Ala Gly Met Asp 
50 55 60 

Ala Val Gin Arg Arg Leu Met Phe Glu Asp Glu Cys He Leu Val Asp 
65 70 75 80 

Glu Asn Asp Lys Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu 
85 90 95 

Met Glu Lys He Glu Ser Glu Asn Leu Leu His Arg Ala Phe Ser Val 
100 105 ~ 110 

Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala 
115 120 125 

Thr Lys Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His 
130 135 140 

Pro Leu Tyr Arg Glu Ser Glu Leu He Asp Glu Asn Cys Leu Gly Val 
1^5 150 155 " 160 

Arg Asn Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly He Pro Ala 
165 170 " 175 

Glu Asp Leu Pro Val Asp Gin Phe He Pro Leu Ser Arg He Leu Tyr 
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180 185 190 

Lys Ala Pro Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu 
195 , 200 205 

Leu Phe lie lie Arg Asp Val Asn Leu Asp Pro Asn Pro Asp Glu Val 
210 215 220 

Ala Glu Val Lys Tyr Met Asn Arg Asp Asp Leu Lys Glu Leu Leu Arg 
225 230 235 240 

Lys Ala Asp Ala Glu Glu Glu Gly Val Lys Leu Ser Pro Trp Phe Arg 
245 250 255 

Leu Val Val Asp Asn Phe Leu Phe Lys Trp Trp Asp His Val Glu Lys 
260 265 270 

Gly Ser Leu, Lys Asp Ala Ala Asp Met Lys Thr lie His Lys Leu 
275 280 285 

<210> 18 
<211> 261 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 

Thr Gly Pro Pro Pro Arg Phe Phe Pro lie Arg Ser Pro Val Pro Arg 
15 10 15 

Thr Gin Leu Phe Val Arg Ala Phe. Ser Ala Val Thr Met Thr Asp Ser 
20 25 30 

Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe Glu Asp 
35 40 45 

Glu Cys lie Leu Val Asp Glu Asn Asn Arg Val Val Gly His Asp Thr 
50 55 60 

Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala Glu Asn Leu Leu 
65 70 75 80 

His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu 
85 90 95 

Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro Leu Val Trp Thr 
100 105 110 

Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu lie Glu 
115 120 125 

Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Phe Asp 
130 135 140 

Glu Leu Gly lie Val Ala Glu Asp Val Pro Val Asp Glu Phe Thr Pro 
145 150 155 160 

Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly Glu 
165 170 ' " 175 

His Glu Val Asp Tyr Leu Leu Phe lie Val Arg Asp Val Lys Leu Gin 
180 185 190 
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Pro Asn Pro Asp Glu Val Ala Glu lie Lys Tyr Val Ser Arg Glu Glu 
195 200 ** .205 

Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp Glu Ala Val Lys' 
210 215 220 

Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Met Lys Trp 
225 230 235 240 

Trp Asp His Val Glu Lys Gly Thr He Thr Glu Ala Ala Asp Met Lys 
245 250 255 

Thr He His Lys Leu 
260 

<210> 19 
<211> 288 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 19 

Met Thr Ala Asp Asn Asn Ser Met Pro His Gly Ala Val Ser Ser Tyr 
1 5 10 15 

Ala Lys Leu Val Gin Asn Gin Thr Pro Glu Asp He Leu Glu Glu Phe 
20 25 *" 30 

Pro Glu He He Pro Leu Gin Gin Arg Pro Asn Thr Arg Ser Ser Glu 
35 40 45 

Thr Ser Asn Asp Glu Ser Gly Glu Thr Cys Phe Ser Gly His Asp Glu 
50 55 60 

Glu Gin He Lys Leu Met Asn Glu Asn Cys He Val Leu Asp Trp Asp 
6 5 70 75 80 

Asp Asn Ala He Gly Ala Gly Thr Lys Lys Val Cys His Leu Met Glu 

85 90 95 

Asn He Glu Lys Gly Leu Leu His Arg Ala Phe Ser Val Phe He Phe 
100 105 HO 

Asn Glu Gin Gly Glu Leu Leu Leu Gin Gin Arg Ala Thr Glu Lys He 
115 120 " 125 

Thr Phe Pro Asp Leu Trp Thr Asn Thr Cys Cys Ser His Pro Leu Cys 
130 135 140 

He Asp Asp Glu Leu Gly Leu Lys Gly Lys Leu Asp Asp Lys He Lys 
!45 150 155 ^ 160 

Gly Ala He Thr Ala Ala Val Arg Lys Leu Asp His Glu Leu <51y He 
165 170 175 

Pro Glu Asp Glu Thr Lys Thr Arg Gly Lys Phe His Phe Leu Asn Arg 
180 185 190 

He His Tyr Met Ala Pro Ser Asn Glu Pro Trp Gly Glu His Glu He 
195 200 ~ 205 

Asp Tyr He Leu Phe Tyr Lys He Asn Ala Lys Glu Asn Leu Thr Val 
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210 215 220 

Asn Pro Asn Val Asn Glu Val Arg Asp Phe Lys Trp Val Ser Pro Asn 
225 230 235 ' 240 

Asp Leu Lys Thr Met Phe Ala Asp Pro Ser Tyr Lys Phe Thr Pro Trp 
24 5 250 " 255 

Phe Lys lie lie Cys Glu Asn Tyr Leu Phe Asn Trp Trp Glu Gin Leu 
260 v 265 ~ 270 

Asp Asp Leu Ser Glu Val Glu Asn Asp Arg Gin lie His Arg Met Leu 
275 280 285 

<210> 20 
<211> 456 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence of four plant B-cyclases 1 

<400> 20 

Met Asp Thr Leu Leu Lys Thr Pro Asn Leu Glu Phe Leu Pro His Gly 
15 10 15 

Phe Val Lys Ser Phe Ser Lys Phe Gly Lys Cys Glu Gly Val Cys Val 
20 25 " 30 

Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys Glu Asn 
35 40 45 

Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val Val Asp 
50 55 60 

Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val Ala Gin Gin 
65 70 75 80, 

Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp Pro Pro Lys Leu 
85 90 95 

He Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu Phe Glu Ala Met 
100 105 110 

Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly Ala Val Tyr He 
115 120 125 

Asp Asp Thr Lys Asp Leu Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin 
130 135 140 

Leu Lys Ser Lys Met Met Gin Lys Cys He Asn Gly Val Lys Phe His 
145 150 155 160 

Gin Ala Lys Val He Lys Val He His Glu Glu Lys Ser Met Leu He 
165 170 175 

Cys Asn Asp Gly Thr He Gin Ala Thr Val Val Leu Asp Ala Thr Gly 
180 185 190 

Phe Ser Arg Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 
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195 200 205 

Val Ala Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Lys 
.210 • 215 220 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Asn Asn Glu Leu Lys 
225 230 235 240 

Glu Arg Asn Ser He Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser 
245 250 255 

Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly Leu ■ 
260 265 270 

Arg Met Asp Asp He Gin Glu Arg Met Val Ala Arg Leu His Leu Gly 
275 280 285 

He Lys Val ' Lys Ser He Glu Glu Asp , Glu His Cys Val He Pro Met 
290 295 300 

Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly He Gly Glv 
305 310 315 320 

Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr 
325 330 335 

Leu Ala Ala Ala Pro Val Val Ala Asn Ala He lie Tyr Leu Gly Ser 
340 345 350 

Glu Ser Ser Gly Glu Leu Ser Ala Glu Val Trp Lys Asp Leu Trp Pro 
355 360 365 

He Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly Met Asp He 
370 375 380 

Leu Leu Lys Leu Asp Leu Pro Ala Thr Arg Arg Phe Phe Asp Ala Phe 
385 390 395 " 400 

Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu 
405 410 415 

Phe Leu Pro Glu Leu He Val Phe Gly Leu Ser Leu Phe Ser His Ala 
420 425 430 

Ser Asn Thr Ser Arg Glu He Met Thr Lys Gly Thr Pro Leu Val Met 
435 440 ~ 445 

He Asn Asn Leu Leu Gin Asp Glu 
450 455 

<210> 21 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 21 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
15 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 
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Ser Tyr Arg Asn lie Arg Phe Gly Leu Cys Sfer Val Arg Ala Ser Gly 
35 40 4 5 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 95 

Asp Lys Leu Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val 
100 105 110 

Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 160 

Lys Cys He Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp 
165 170 " 175 

Asp Lys Pro He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 
225 230 235 24 0, 

Thr Val* Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg He Leu Lys 
340 345 ~ 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro 
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355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 ■ 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lys Tyr Ala Ser Val lie Ala Glu lie Leu Arg Glu Glu Thr Thr Lys 
405 410 415 

Gin lie Asn Ser Asn lie Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro ■ 
420 425 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 445 

lie Val Gin Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser 
485 490 " 495 

Pro Asn Asn Leu Arg Lys Gly Leu lie Asn His Leu He Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 22 
<211> 1898 
<212> DNA 

<213> Adonis palaestina 
<400> 22 

AAAGGAGTGT TCTATTAATG TTACTGTCGC ATTCTTGCAA CACTTATATT CAAACTCCAT 60 
TTTCTTCTTT TCTCTTCAAA ACAACAAACT AATGTGAGCA GAGTATCTGG CTATGGAACT 120 
ACTTGGTGTT CGCAACCTCA TCTCTTCTTG CCCTGTGTGG ACTTTTGGAA CAAGAAACCT 180 
TAGTAGTTCA AAACTAGCTT ATAACATACA TCGATATGGT TCTTCTTGTA GAG T AG ATT T 24 0 

TCAAGTGAGA GCTGATGGTG GAAGCGGGAG TAGAAGTTCT GTTGCTTATA AAGAGGGTTT 300 
TGTGGATGAA GAGGATTTTA TCAAAGCTGG TGGTTCTGAG CTTTTGTTTG TCCAAATGCA 360 
GCAAACAAAG TCTATGGAGA AACAGGCCAA GCTCGCCGAT AAGTTGCCAC CAATACCTTT 420 
TGGAGAATCC GTGATGGACT TGGTTGTAAT AGGTTGTGGA CCTGCTGGTC TTTCACTGGC 4 80 

TGCAGAAGCT GCTAAGCTAG GGTTGAAAGT TGGCCTTATT GGTCCTGATC TTCCTTTTAC 54 0 

AAATAATTAT GGTGTGTGGG AAGACGAGTT CAAAGATCTT GGACTTGAAC GTTGTATCGA 600 
GCATGCTTGG AAGGACACCA TCGTATATCT TGATAATGAT GCTCCTGTCC TTATTGGTCG 660 
TGCATATGGA CGAGTTAGTC GACATTTGCT ACATGAGGAG TTGCTGAAAA GGTGTGTGGA 720 
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GTCAGGTGTA TCATATCTTG ATTCTAAAGT GGAAAGGATC 

TAGCCTTGTA GTTTGTGAAA ATGAGATCTT TATCCCTTGC 

TGGAGCAGCT TCAGGGAAAC TTTTGGAGTA TGAAGTAGGT 

AACCGCTTAT GGGGTGGAGG TTGAGGTGGA GAACAATCCA 

ATTCATGGAC TACAGAGACT ATATGCAACA G AAA T T AC AG 

AACATTTCTC TATGTCATGC CCATGTCGCC AACAAGACTT 

GGCCTCAAAA GATGCCATGC CATTCGATCT ACTGAAGAGA 

GACTCTGGGT ATCCAAGTTA CAAAAGTTTA TGAAGAGGAA 

TGGTTCTTTA CCAAACACAG AGCAAAAGAA CCTAGCATTT 

GCATCCAGCA ACAGGCTATT CGGTTGTACG GTCACTGTCA 

TGTAATTGCA AAGATTTTGA AGCAAGATAA CTCTGCGTAT 

TGCAGTAAAC ATTTCAATGC AAGCATGGAG CAGTCTTTGG 

AAGAGCATTC TTTCTTTTTG GATTAGAGCT TATTGTGCAG 

AACATTCTTT AGAACCTTCT TCCGCTTGCC AACTTGGATG 

TTCACTATCA TCTTTCGATC TCGTCTTGTT TTCCATGTAC 

CAGCATGAGG ATGTCACTTG TGAGACATT.T GCTTTCAGAT 

AAGAGCTTAC CTCGAAAGGT AGTCTCATCT ATTATTAAAC 

ATGAGGATCC TTCGAATGTG TATATGATCA TCTCTATGTA 

TAAAGTAAAT GCCGGGTTTG ATATTGTTGT GTCAAACCGG 

TATTGATACA AAAGTAGTTT TTTTCCTTAA AAAAAAAA 

<210> 23 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 23 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser 
1 5 10 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys 
20 25 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe 
35 " 40 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr 
50 55 

Asp Glu Glu Asp Phe He Lys Ala Gly Gly Ser 
65 70 75 



PCTAJS99/12121 



ACTGAAGCTG 
AGGCTTGCTA 
GGCCCTCGTG 
TACGATCCCA 
TGCTCGGAAG 
TTTTTTGAGG 
AAACTGATGT 
TGGTCATATA 
GGTGCTGCAG 
GAAGCTCCAA 
GTGGTTTCTG 
CCAAAGGAGC 
CTAGATATTG 
TGGTGGGGTT 
ATGTTTGTTT 
CCTTCTGGTG 
TCTAGTGTTT 
TATCCTGTAC 
CCAAT GAT AT 



GTGATGGCCA 
CTGTTGCATC 
TTTGTGTCCA 
ACTTAATGGT 
AAGAATATCC 
AAACCTGTTT 
CACGATTGAA 
TTCCTGTTGG 
CAAGCATGGT 
AATATGCTTC 
GACAAAGTAG 
GAAAACGTCA 
AAGCAACCAG 
TCCTTGGGTC 
TGGCGCCAAA 
CAGTTATGGT 
CACCAAATAA 
TCTAATCTCA 
AAAGTAAATT 



Ser Cys Pro Val Trp 

15^ 

Leu Ala Tyr Asn He 
30 

Gin Val Arg Ala Asp 
45 

Lys Glu Gly Phe Val 
60 

Glu Leu Leu Phe Val 
80 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
18 98 
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Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 95 

Lys Leu Pro Pro lie- Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 no 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu He Phe He Pro Cys Arg Leu Ala Thr 
225 230 235 " 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 - 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
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405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val. Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 

<210> 24 
<211> 1370 
<212> DNA 
<213> Potato 

<400> 24 

TAGCGGAGGA TGAGTTCAAA GATCTTGGTC TTCAAGCCTG CATTGAACAT GTTTGGCTGG 60 

GATACCATTG TATATCTTGA TGATGATGAT CCTATTCTTA TTGGCCGTGC CTATGGAAGA 120 

GTTAGTCGCC ATTTACTGCA CGAGGAGTTA CTCAAAAGGT GTGTGGAGGC AGGTGTTTTG 180 

TATCTAAACT CGAAAGTGGA TAGGATTGTT GAGGCCACAA ATGGCCACAG TCTTGTAGAG 240 

TGCGAGGGTG ATGTTGTGAT TCCCTGCAGG TTTGTGACTG TTGCATCGGG AGCAGCCTCG 300 

GGGAAATTCT TGCAGTATGA GTTGGGAGGT CCTAGAGTTT CTGTTCAAAC AGCTTATGGA 360 

GTGGAAGTTG AGGTCGATAA CAATCCATTT GACCCGAGCC TGATGGTTTT CATGGATTAT 4 20 

AGAGACTATG TCAGACACGA CGCTCAATCT TTAGAAGCTA AATATCCAAC ATTTCTCTAT 4 80 

GCCATGCCCA TGTCTCCAAC ACGAGTCTTT TTCGAGGAAA CTTGTTTGGC TTCAAAAGAT 54 0 

GCAATGCCAT TCGATCTGTT AAAGAAAAAA TTGATGTTAC GATTGAACAC CCTCGGTGTA 600 

AGAATTAAAG AAATTTATGA GGAGGAATGG TCTTACATAC CAGTTGGAGG ATCTTTGCCA 660 

AATACAGAAC AAAAAACACT TGCATTTGGT GCTGCTGCTA GCATGGTTCA TCCAGCCACA 720 

GGTTATTCAG TCGTCAGATC ACTGTCTGAA GCTCCAAAAT GCGCCTTCGT GCTTGCAAAT 7 80 

ATATTACGAC AAAATCATAG CAAGAATATG CTTACTAGTT CAAGTACCCC GAGTATTTCA 840 

ACTCAAGCTT GGAACACTCT TTGGCCACAA GAACGAAAAC GACAAAGATC GTTTTTCCTA 900 
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TTTGGACTGG CTCTGATATT GCAGCTGGAT ATTGAGGGGA 

TTCTTCCGTG TGCCAAAATG GATGTGGCAG GGATTTCTTG 

ACCTCATGTT ATTTGCCTTC TACATGTTTA .TTATTGCACC 

TAATCAGACA TCTTTTATCT GATCCTACTG GTGCAACATT 

TTTAGAGTAA ATTCCTCCTA CAATAGTTGT TGAAAGAGGC 

ACAGAAATCG CGGTCTCTCG AGGCCTTGTA TATAACATTT 

GAATAAGTTG CACAGTTTCA GTTTTTGTAT CTGCTTCTTT 

GACCAATTTA TATACATTGC CAGTATATAT AAATTTTATA 

<210> 25 

<211> 377 

<212> PRT 

<213> Potato 

<400> 25 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys 
15 10 

Arg Asp Thr He Val.Tyr Leu Asp Asp Asp Asp 
20 25 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu 
35 40 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu 
50 55 

Arg He Val Glu Ala Thr Asn Gly His Ser Leu 
65 70 75 

Asp Val Val He Pro Cys Arg Phe Val Thr Val 
85 90 

Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly 
100 - 105 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp 
115 120 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp 
130 135 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe 
145 150 155 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr 
165 170 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys 
180 185 

Asn Thr Leu Gly Val Arg He Lys Glu He Tyr 
195 200 



PCT/US99/12I21 



TAAGGTCATT TTTCCGCGCG 960 

GTTCAAGTCT TTCTTAGCAG 1020 

AAATGACATG AGAAGAGGCT 1080 

GATAAGAACT TATCTTACAT 1140 

CTCATTACTT CAGATTCATA 1200 

TCACTAGGTT AATATTGCTT 1260 

TTTGTCCAAG ATCATGTATT 1320 

AAAAAAAAAA 1370 



He Glu His 

Pro He Leu 
30 

His Glu Glu 
45 

Asn Ser Lys 
60 

Val Glu Cys 

Ala Ser Gly 

Pro Arg Val 
110 

Asn Asn Pro 
125 

Tyr Val Arg 
140 

Leu Tyr Ala 
Cys Leu Ala 



Leu Met Leu 
190 

Glu Glu Glu 
205 



Val Trp 
15 

He Gly 
Leu Leu 
Val Asp 



Glu Gly 
80 

Ala Ala' 
95 

Ser Val 



Phe Asp 

His Asp 

Met Pro 
160 

Ser Lys 
175 

Arg Leu 
Trp Ser 
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Tyr 



He 
210 



Pro Val Gly Gly 



Ser 
215 



Leu Pro Asn Thr Glu Gin Lys Thr Leu 
220 



Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 230 235 240 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
245 250 255 

Asn He Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 270 

Thr Pro Ser He Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 285 

Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu Ala Leu He Leu 
290 295 300 

Gin Leu Asp He Glu Gly He Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 310 315 320 

Val Pro Lys Met Met Trp Gly Phe Leu Gly Ser Ser Leu Ser Xaa Ala 
325 330 335 

Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He He Ala Pro Asn Asp 
340 345 350 

Met Arg Arg Gly Leu He Arg His Leu Leu Ser Asp Pro Thr Gly Ala 
355 360 365 

Thr Leu He Arg Thr Tyr Leu Thr Phe 
370 375 

<210> 26 
<211> 533 
<212> PRT 

<213> Chimeric lettuce/potato 
<400> 26 

Met Glu Cys Phe Gly Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
15 10 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn He Arg His Lys Phe Ser Leu 
20 25 30 

Leu Lys Gly Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arg 
35 40 45 

Gin He Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50 " 55 60 

Gly He Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin lie Pro He Gly Asn Cys He 
100 105 110 

Leu Asp Leu Val Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
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115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu He Gly Pro Asp 
130 . 135 140 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe He Glv 
145 150 155 160 

Leu Gly Leu Glu Gly Cys He Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asp Asp Ala Asp Pro He Arg He Gly Arg Ala Tyr Gly Arq* 
180 185 " 190 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg He Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu He Glu Cys Glu Gly Asn He Thr He Pro 
225 230 235 240 

Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245 250 255 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Ser Val Gin Thr Ala Tyr Gly 
260 265 270 

Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp Pro Ser Leu Met Val 
275 280 * 285 

Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp Ala Gin Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro Met Ser Pro Thr Arq 
305 310 315 320 

Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe 
325 330 ' 335 

Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu Asn Thr Leu Gly Val 
340 345 350 

Arg He Lys Glu He Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu Ala Phe <31y Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arq Ser Leu 
385 390 395 400 

Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala Asn He Leu Arg Gin 
405 410 415 

Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser Thr Pro Ser He Ser 
420 425 430 

Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu Arg Lys Arq Gin Arq 
435 440 445 
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Ser 



Phe Phe 
450 



Leu 



Phe Gly 



Leu Ala Leu lie Leu Gin Leu Asp lie Glu 
455 460 



Gly 
465 



lie Arg 



Ser 



Phe, Phe 
470 



Arg Ala Phe Phe Arg Val Pro Lys Trp Met 
475 * 480 



Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa Ala Asp Leu Met Leu 
485 490 495 

Phe Ala Phe Tyr Met Phe lie lie Ala Pro Asn Asp Met Arg Arg Gly 
500 505 ^ 510 

Leu lie Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Leu He Arg 
515 520 ' 525 

Thr Tyr Leu Thr Phe 
530 

<210> 27 
<211> 374 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 27 

Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys He Glu His Val 
1 5 10 15 

Trp Arg Glu Thr He Val Tyr Leu Asp Asp Asp Lys Pro He Thr lie 
20 25 30 

Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His Glu Glu Leu 
35 40 45 

Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser Ser Lys Val 
50 55 60 

Asp Ser He Thr Glu Ala Ser Asp Gly Leu Arg Leu Val Ala Cys Asp 
65 70 75 80 

Asp Asn Asn Val He Pro Cys Arg Leu Ala Thr Val Ala Ser Gly Ala 
85 90 95 

Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro Arg Val Cys 
100 105 " 110 

Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn Ser Pro Tyr 
115 120 125 

Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr Thr Asn Glu 
130 135 140 

Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu Tyr Ala Met 
145 150 155 160 

Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys Leu Ala Ser 
165 170 175 

Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu Met Leu Arg 
180 185 190 

Leu Asp Thr Leu Gly He Arg lie Leu Lys Thr Tyr Glu Glu Glu Trp 
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195 200 205 

Ser Tyr lie Pro Val Gly Gly Ser Leu Pro'Asn Thr Glu Gin Lys Asn 
210 215 220 

Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Glv Tvr 
225 230 235 Y 240 

Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Tyr Ala Ser Val He 
245 250 ^ 255 

Ala Glu He Leu Arg Glu Glu Thr Thr Lys Gin He Asn Ser Asn He 
260 265 270 

Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro Pro Glu Arg Lys Arg Gin 
275 280 285 

Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu He Val Gin Phe Asp Thr 
2 $0 295 300 

Glu Gly He Arg Ser Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp 
305 310 315 320 

Met Trp Gin Gly Phe Leu Gly Ser Thr Leu Thr Ser Gly Asp Leu Val 
325 330 335 

Leu Phe Ala Leu Tyr Met Phe Val He Ser Pro Asn Asn Leu Arq Lys 
340 345 350 

Gly Leu He Asn His Leu lie Ser Asp Pro Thr Gly Ala Thr Met He 
355 360 365 

Lys Thr Tyr Leu Lys Val 
370 

<210> 28 
<211> 1002 
<212> DNA 

<213> Adonis palaestina 
<400> 28 

ATTCATCTTC AGCAGCGCTG TCGTACTCTT TCTATATCTT CTTCCATCAC T AAC AG TAG T 60 

CGCCGACGGT TGAATCGGCT ATTCGCCTCA ACGTCAACTA TGGGTGAAGT CACTGATGCT 120 

GGAATGGATG CTGTTCAGAA GCGGCTCATG TTCGACGACG AATGTATTTT GGTGGATGAG 180 

AATGACAAGG TCGTCGGGCA TGATTCCAAA TACAACTGTC ATTTGATGGA AAAGATAGAG 240 

GCAGAAAATT TGCTTCACAG AGCCTTCAGT GTTTTCTTGT TC AAC TC AAA ATATGAATTG 300 

CTTCTTCAGC AACGATCCGC CACAAAGGTA ACATTCCCGC TCGTATGGAC AAACACATGT 360 

TGCAGTCATC CTCTCTTTCG TGATTCCGAG CTCATAGAAG AAAATTATCT CGGTGTACGA 420 

AACGCTGCAC AAAGAAAGCT TTTAGACGAG CTAGGCATTC CAGCTGAAGA TGTCCCAGTT 4 80 

GATGAATTTA CTCCTCTTGG TCGCATTCTT TACAAAGCTC CATCTGACGG CAAATGGGGA 540 

GAGCACGAAT TGGACTATCT CCTATTTATT GTCCGAGATG TGAAATACGA TCCAAACCCA 600 
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GATGAAGTTG CTGATGCTAA GTATGTTAAT CGCGAGGAGT TGAGAGAGAT ACTGAGAAAA 660 

GCTGATGCTG GTGAAGAGGG ACTCAAGTTG TCTCCTTGGT TTAGATTGGT TGTTGATAAC 

TTTTTGTTCA AGTGGTGGGA TCATGTAGAG CAGGGTACGA TTAAGGAAGT TGCTGACATG 

AAAACTATCC ACAAGTTGAC TTAAGAGGAC TTCTCTCCTC TGTTCTACTA TTTGTTTTTT 

GCTACAATAA GTGGGTGGTG ATAAGCAGTT TTTCTGTTTT CTTTAATTTA TGGCTTTTGA 

ATTTGCCTCG ATGTTGAACT TGTAACATAT TTAGACAAAT ATGAGACCTT GTAAGTTGAA 

TTTGAGGCTG AATTTATATT TTTGGGAACA TAATAATGTT AA 

<210> 29 
<211> 1271 
<212> DNA 

<213> Adonis palaestina 
<400> 29 



720 
780 
840 
900 
960 
1002 



TTTTAAAGCT 


CTTTCGCTCC ACCACCATCA AAGCCAGCCA 


AATTTCTCTG 


TACAAAAGTT 


60 


AAAAACACCG 


CTTTGGGCTT 


TGGCCCCTCC 


ATATCGGAAT 


CCTTGTTTAC 


GATACGCATC 


120 


TAAACCAGTA 


ATTCTCGGTT 


TTAATTTGTT 


TCCTAAATTA 


GGCCCCTTTC 


CGGAATCCCG 


180 


AGAATTATGT 


CGTCGATCAG 


GATTAATCCT 


TTATATAGTA 


TCTTCTCCAC 


CACCACTAAA 


240 


ACATTATCAG 


CTTCGTGTTC 


TTCTCCCGCT 


GTTCATCTTC 


AGCAGCGTTG 


TACGTACTCT 


300 


TTCTATTTCT 


TCTTCCATCA 


CTAACAGTCC 


TCGCCGAGGG TTGAATCGGC 


TGTTCGCCTC 


360 


AACGTCGACT 


ATGGGTGAAG 


TCGCTGATGC 


TGGTATGGAT 


GCCGTCCAGA 


AGCGGCTTAT 


420 


GTTCGACGAT 


GAATGTATTT 


TGGTGGATGA 


GAATGACAAG 


GTCGTCGGAC 


ATGATTCCAA 


480 


ATACAACTGT 


CATTTGATGG 


AAAAGATAGA 


GGCAGAAAAC 


TTGCTTCACA 


GAGCCTTCAG 


54 0 


TGTTTTCTTA 


TTCAACTCAA 


AATACGAGTT 


GCTTCTTCAG 


CAACGATCTG 


CAACGAAGGT 


600 


AACATTCCCG 


CTCGTATGGA 


CAAACACCTG 


TTGCAGCCAT 


CCCCTCTTCC 


GTGATTCCGA 


660 


ACTCATAGAA 


GAAAATTTTC 


TCGGGGTACG 


AAACGCTGCA 


CAAAGGAAGC 


TTTTAGACGA 


720 


GCTAGGCATT 


CCAGCTGAAG 


ACGTACCAGT 


TGATGAATTC 


ACTCCTCTTG 


GTCGCATTCT 


780 


TTACAAAGCT 


CCATCTGACG 


GAAAATGGGG 


AGAGCACGAA 


CTGGACTATC 


TTCTGTTTAT 


840 


TGTCCGAGAT 


GTGAAATACG 


ATCCAAACCC AGATGAAGTT GCTGACGCTA AGTACGTTAA 


900 


TCGCGAGGAG 


TTGAAAGAGA 


TACTGAGAAA AGCTGATGCA GGTGAAGAGG 


GAATAAAGTT 


960 


GTCTCCTTGG 


TTTAGATTGG 


TTGTGGATAA 


CTTTTTGTTC 


AAGTGGTGGG 


AT CAT GT AG A 


1020 


GGAGGGGAAG ATTAAGGACG 


TCGCCGACAT 


GAAAACTATC 


CACAAGTTGA 


CTTAAGAGAA 


1080 


AGTCTCTTAA GTTCTACTAT 


TTGGTTTTTG 


CTTCAATAAG 


TGGATGGTGA 


TGAGCAGTTT 


1140 


TTATGCTTCC 


TTTAATTf TG 


GCTTTTCAAT 


TTGCTTTATG 


TGTTGAACTT 


GTAACATATT 


1200 


TAGTCAAATA 


TGAGACCTTG 


TGAGTTGAAT 


TTGAGGTTAT 


ATTTATAGTT 


TTGGGAACAT 


1260 
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AAAAAAAAaA A 1271 

<210> 30 
.<211> 1109 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 30 



TGGAAPPTPP 




L 1 LLLA1 LjLL 


LCGATGCTTC 


GTTCGTTGCT 


CAGAGGCCTC 


60 


ACGCATATCC 


PPPPPPTPAA 


PTPPPPr*r*2ir* 


LALLLLALL I 


b 1 GCACACGC 


GCGACTCCAG 


120 


TTTAAGCTCA 


GGAPPATPPA 


PPTPPTTPPP 


n a r* r* a cnc c z\ 


UAvjaCCACAT 


GAGGGGTGCA 


180 


AGCACCTGGG 


PAPPPPPPPA 

r± \j \j v_» vj u vj V- r\ 


PTPPPAPP AT 


p zx rr tp zi tc r* 

X VjA 1 LL 


1 IdAAvjGACGA 


GTGCATCTTA 


240 


GTGGATGCTG 


ACGACAACAT 


PAPAPPPPAT 


PPPAPPA APP 


1 LjLjALj 1 IjLL A 


CAaaTTCCTA 


300 


CCACATCAGC 


PTPPAPPPPT 


PPTPPAPPPP 


PPPTTPTPTP 


lull CL 1GTT 


TGACGACCAG 


360 


GGGCGACTGC 


TGCTGPAAPA 


PPPTPPAPPP 


TP Zl Z\ 21 A 2\ TP Z\ 


LL 1 1 UUCAACa 


TGTGTGGACG 


420 


AACACCTGCT 


PPAPPPAPPP 


TPT APATPPP 


LAlaALLLLAL 


Al GAGGTGGA 


CCAACTAAGC 


480 




7A p c m a P Zi T 
ALtatjLALAu 1 


ALL 1 LLLLLA 


AAAGLTGlTG 


CCATCCGCAA 


GTTGGAGCAC 


540 


papptppppa 


1 ALL.At3L.ljL> A 


LLALL 1 LLLL 


GlaAGCGlGT 


TTCGCTTCCT 


CACGCGTTTG 


600 


PAPTAPTPTP 


LLLLLLjALL 1 


LLALLLLLL 1 


GCGACACAAT 


CAGCGCTCTG 


GGGCGAGCAC 


660 


GAPATPPAPT 






AALGTCACCT 


TGGCGCCCAA 


CCCTGACGAG 


720 


GTPPAPPAAP 


I LAova 1 ALL J. 


LALLLAALAL 


GAGlTGCGGC 


AGATGATGCA 


GCCGGACAAC 


780 


ywl IIjLAAI 


GGTCGCCGTG 


GTTTCGCATC 


ATCGCCGCGC 


GCTTCCTTGA 


GCGTTGGTGG 


840 


GCTGACCTGG 


ACGCGGCCCT 


AAACACTGAC 


AAACACGAGG 


ATTGGGGAAC 


GGTGCATCAC 


900 


ATCAACGAAG 


CGTGAAGGCA 


GAAGCTGCAG 


GATGTGAAGA 


CACGTCATGG 


GGTGGAATTG 


960 


CGTACTTGGC 


AGCTTCGTAT 


CTCCTTTTTC 


TGAGACTGAA 


CCTGCAGAGC 


TAGAGTCAAT 


1020 


GGTGCATCAT 


ATTCATCGTC 


1L1L1 i 1 1 L 1 


111 Ab AL 1 AA 


TCTGTAGCTA 


GAGTCACTGA 


1080 


TGAATCCTTT 


ACAACTTTCA 


A A A A A AZX&Zi 
ri/lAAAAAAA 








1109 


<210> 31 
<211> 985 
<212> DNA 

<213> Lactuca sativa 












<400> 31 
TGCCAAAATG 


TTGAAATTTC 


CCCCTTTTAA 


AACCATTGCT 


ACCATGATCT 


CTTCTCCATA 


60 


TTCTTCCTTC 


TTGCTGCCTC 


GGAAATCTTC 


TTTCCCTCCA 


ATGCCGTCTC 


TCGCAGCCGC 


120 


TAGTGTTTTC 


CTCCACCCTC 


TTTCGTCTGC 


CGCTATGGGC 


GATTCCAGCA 


TGGATGCTGT 


180 


CCAGCGACGT 


CTCATGTTCG 


ATGACGAATG 


CATTTTGGTG 


GATGAGAATG 


ACAAAGTGGT 


240 


TGGCCATGAT 


ACTAAATACA 


ATTGTCATTT 


GATGGAGAAG 


ATTGAAAAGG GAAATATGCT 


300 



IZ-v 
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ACACAGAGCA 


TTCAGTGTGT 


TCTTGTTCAA 


CTCGAAATAT 


GAATTACTCC 


TTCAGCAACG 


360 


TTCTGCAACC 


AAGGTGACTT 


TCCCTTTGGT 


ATGGACAAAC 


ACGTGTTGCA 


GCCATCCACT 


4 20 


ATACAGGGAG 


AGTGAGCTTA 


TTGACGAAAA 


CGCCCTTGGG 


GTGAGGAATG 


CTGCACAGAG 


4 80 


GAAGCTCCTG 


GATGAACTCG 


GCATCCCTGG' 


AGCAGATGTT 


CCGGTTGATG 


AGTTCACTCC 


54 0 


ATTGGGTCGC 


ATTCTATACA 


AGGCCGCATC 


GGATGGAAAG 


TGGGGAGAAC 


ATGAACTTGA 


600 


TTACCTGCTG 


TTTATGGTAC 


GTGATGTTGG 


TTTGGATCCG 


AACCCAGATG 


AAGTGAAAGA 


660 


TGTAAAATAT 


GTGAACCGGG 


AAGAGCTGAA 


GGAATTGGTA 


AGGAAGGCGG 


ATGCTGGTGA 


720 


AGAGGGTGTG 


AAGCTGTCCC 


CGTGGTTCAA 


ATTGATTGTC 


GATAATTTCT 


TGTTTCAGTG 


780 


GTGGGATCGA 


CTCCATAAGG 


GAACCCTAAC 


CGAAGCTATT 


GATATGAAAA 


CAATCCACAA 


84 0 


ACTCACATAA 


AAACACTACA 


CTAGTAGGAG 


AGAGGATTAT 


ATGAGATATT 


TGTTATATGT 


900 


GAAATTGAAA 


TTCAGATGAA 


TGCTTGTATT 


TATTTCTATT 


TGGACAAACT 


TCAACTTCTT 


960 


TTTGCTACCT 


TATCAGAAAA 


AAAAA 








985 


<210> 32 
<211> 988 
<212> DNA 

<213> Lactuca sativa 












<400> 32 
TATTCGCTTC 


AAAATCTCTT 


CCATTAACTG 


CTCAAATCTC 


CACCTTCGCC 


GGTCTTAATC 


60 


TCCGCCGGCG 


CACTTTCACC 


ACCATAACCG 


CCGCCATGGG 


TGACGATTCC 


GGCATGGACG 


120 


CTGTCCAGAG 


ACGTCTCATG 


TTTGATGATG 


AATGCATTTT 


GGTTGATGAA 


AATGACAATG 


180 


TTCTTGGGCA 


T GAT AC C AAA 


TACAATTGTC 


ACTTGATGGA 


GAAGATTGAG 


AAAGATAATT 


240 


TGCTTCATAG 


AGCATTCAGT 


GTATTTTTAT 


TCAATTCAAA 


ATACGAATTA 


ctccttcAgc 


300 


AAAGGTCAGA 


AACCAAGGTG 


ACATTTCCTT 


TGGTATGGAC 


AAACACCTGT 


TGCAGCCATC 


360 


CACTATACAG 


AGAATCGGAG 


TTAATTCCCG 


AAAATGCCCT 


TGGGGTCAGA 


AATGCTGCAC 


420 


AGAGGAAGCT 


TCTAGATGAA 


CTCGGTATCC 


CTGCTGAAGA 


TGTTCCAGTT 


GATGAGTTCA 


480 


CAACTTTAGG 


TCGCATGTTG 


TACAAGGCTC 


CATCTGATGG 


AAAATGGGGT 


GAACATGAAG 


540 


TTGATTACCT 


ACTCTTCCTC 


GTGCGTGACG 


TTGCCGTGAA 


CCCAAACCCT 


GATGAGGTGG 


600 


CGGACATTAG 


ATACGTGAAC 


CAAGAAGAGT 


TAAAAGAGTT 


ACTAAGGAAG 


GCGGATGCGG 


660 


GTGAGGAGGG 


TTTGAAATTG 


TCCCCATGGT 


TTAGGCTAGT 


GGTGGACAAC 


TTCTTGTTCA 


720 


AATGGTGGGA 


TCATGTCCAA 


AAGGGGACAC 


TCAATGAAGr 






Ton 


ATAAGTTGAT 


ATGAAAAATG 


GTTAATATTT 


ATGGTGGTGG 


TTTGGAGCTA 


ATAATTTGTG 


840 


TGTTCAAGTC 


TCGGTCCTTC 


TTTTTTTAAC 


GTTTTTTTTT 


TTTCTTTTAT 


TGGGAGTGTT 


900 


TATTGTGTAC 


TTGTAACGTA 


GGCCCTTTGG 


TTACGCTTTA 


AGAGTTTAAT 


AAAGAACCAC 


960 



33 
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CGTTAATTTA AAAAAAAAAA AAAAAAAA 

<210> 33 

<211> 1874 

<212> DNA 

<213> Chlamydomonas reinhardtii 

<400> 33 



GGCACGAGCT 


vjrio x X X \j l X 


TTAPPATPAP 




1 bbAAbL, lib 


TV TV f** m T\ m /"^ T\ 

AACTACCTCA 


60 


ATTACTCAAG 


TAACTCGCGG 


CAAPACATTT 


PPPPPPPP AT 


^LsL> l b 1111b 


1 b 1 bb 1 bCAG 


120 


CTACCGAGCA 


GCATTGCTTT 


AGATPGPTTT 


PATPTPATA A 


APTPPPZiPTT 


A 1 A 1 bAbATC 


18 0 


CAGTTTCATC 


GAGCCCAAGC 


PPAGAGPPPA 


APPTPTPTTA 


/\bbbbbbbbA 


bbbbb 1 bbAT 


24 0 


GCGCCTPGPG 


PAAAPPPPTP 


PTPTPPTTPP 


PPPTPTPTiPP 


ibbbbbblbl 


GGCCGGGAGC 


300 


AGGAPTTTPA 


PAPPPTPAAA 


OVw-O J. 1 V3^V30 1 


pppiv7atpppp 

uLurtrt I obbb 


Ab 1 1 bbl bAA 


CCTGGGAAGG 


360 


CAPGPPPPTft 


APPP APP ATP 




bbbbbAbbAb 


1 GbTTGGTGG 


TGGACGAGCA 


420 


ggappppptp 


1 /VooL,/t.L,L,Lj 


L,b/^rAb/i/\b 1 /A. 




CGCTTCGAGG 


CGGCCAAGGG 


480 






/vbbbbbbbl 1 




CTGTTCAGCC 


CCGACGGCCG 


540 


aptpptpptp 


p A p p a p p p rr 

V— fto Lnu ^ L. Li 




Lib i L1AUC3 11C 


CCGGGTGTGT 


GGACCAACAC 


600 




b/ibbbbb 1 bb 


bb bb bb Abbb 


GCCGGACGAG 


GTGGACCTGC 


CGGCGGCGGT 


660 




b-rtbb 1 bbbbb 


bbAl b/lAbbb 


c~* r*r* r**r* r* 
CjCjUvjLiCCjiLiI G 


CGCAAGCTGC 


AGCACGAGCT 


720 


vjooun 1 /AV^V—o 


bbbbAbbAbb 


I 1 bbbbbb 1 b 


L.1CC1 1LTCC 


TTCCTCACGC 


GTCTGCACTA 


780 


ptpppppppp 


ort^riLU L. K^r\ 


b b b rtb b Vjr b b b 


Lj LiCLibL. b la ACj 


TGGGGCGAGC 


ACGAGGTGGA 


840 


PTAPPTPPTft 


i. 1 L,Li IvjV^Ijo^ 


b b b/\b b/ib b b 


Lb 1 bAbbb 1 b 


CAGbbbAACC 


CAGACGAGGT 


900 


GGAPGPPAPft 


PPPTAPPTPA 


ppptpppppa 


PPTTPZIPTPP 
1 1 L-/4b 1 L»b 


Al bAl btjbbb 


ACbCCGGCCT 


960 


CAGCTGGAGP 




PPATPPTPPP 




bbb 1 1 bb 1 bb 


bbLibbTGGTG 


1020 


GGGCGACCTG 


AAGPftftPGPT 




PPPPAPPPPA 


PTPTPPP ZiPl 1 
b 1 b 1 bbb/ib 1 


bbbbbAbCA 1 


1 r\ 0 0 


CCACCGCGTP 


ATGTPAAPAA 


AAAGP.GP.AAft 


PAPPCPPPPP 


1\ PPPPPP P ZVT 


TV TV rp /""• TV T\ «T» 


1 1 4 n 
114 0 


GTGAATGCGA 


TTGTGATGCG 


GCGTGGGATG 


AGGTCTGAAG 


ACAGGGGGAA 


AATCGGGGGG 


1200 


CGGGCGTGAG 


CGTGTGTGTA 


CGTGAGCGAC 


AAAGCCGGGA 


GGCGGACCGC 


GCGATGGGTA 


1260 


CATGTGTGTG 


CGGAGGGTCG 


GTGGGTCGGT 


CGGTTGCGCG 


GCATAGCGTG 


TTGTGTGTGT 


1320 


GCGGCTGCAG 


GGGTATGTGG 


GCACCCGGGC 


ACGGAGGAGA 


AGGCACACGC 


AGGTGGCGCG 


1380 


GAGGTGTGTC 


AGGGGCCATG 


GGCGGGCCTC 


ACTCCTGGTC 


GTGCCCAGTG 


GTCTCGTGGG 


1440 


CAGAGTGGCA 


GGGGCTGCAC 


CCATATGAGC 


GGCGCACTGC 


CGCGCTGGGC 


TAAGTCCTTA 


1500 


TCACTTGGTG 


AGGTGGGGCG 


AGGTGGCTGT 


GGGCGGCGGG 


CGCAGTGGCA 


GAAGGACACG 


1560 


GTGTGTGAGC 


GGTGGAGCTC 


TGGCCGTGCC 


GGCCGTGAGG 


GGCGGATAGC 


GATATGACGT 


1620 



ss 
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TGTGPTTP.PP 


ppptptz* ziTr* 


L^tjtjJjM,loA-r\ 1 Lj 


1 (jCA(jbCCGC 


GAGAAGCGGG 


CGGTGGCAGG 


1680 




L 1 GCAGCACC 


CGTTGGGGAG 


GTGCCACCTG 


CAGGCGCGGC 


GCCGGGCGG'G 


1740 


CCT.GAGTAAT 


GGGCGCCTG'A 


GTAGTGGCGG 


CCACAGGAGG 


CGCAGGAGGC 


AGCAGCAGGA 


1800 


GGACGAGCTG 


GAGGGACCCG 


TTGGCAACCC 


AAGGTTGCGC 


GTGTAACATA 


GTGGCCATAC 


1860 


AAAAAAAAAA 


AAAA 










1874 


<210> 34 
<211> 954 
<212> DNA 

<213> Tagetes erecta 












<400> 34 
CCAAAAACAA 


CTCAAATCTC 


CTCCGTCGCT 


CTTACTPPPP 


P Tx TPP PTP z\ p 




60 


TGGATGCTGT 


TCAGCGACGT 


CTCATGTTTG 


ACGATGAATG 


v_/-iX i l 1 oo 1 *o 


TV T» >— « A /"* rp rp 

bAlbAbTbTG 


120 


ACAATGTGGT 


GGGACATGAT 


ACCAAATACA 


ATTGTCAPTT 

** X X VJ X \^.f\V— » J. X 


PZXTPpnpii np 


Al i b AAAbAG 


180 


GTAAAATGCT 


GCACAGAGCA 


TTCAGCGTTT 

x x vnuvjU xxx 


TTpT ATTP7AZ\ 
1 1 \* ± / i 1 i. v-./-\/-i 


1 x OAAAA 1 AC 


GAGTTACTTC 


240 


TTCAGCAACG 


GTCTGCAACC 


AAGGTGAPAT 


TTPPTTTZirT 


Al CjIjACCAAC 


ACCTGTTGCA 


300 


GCCATCCACT 


CTACAGAGAA 




1 1 \^^Co/tAAA 


CvjCCCTTGGA 


GTAAGAAATG 


360 


CTGCACAGAG 


GAAGCTGTTG 


n a t p a z\ r* t p p 




1 GAAGATGTT 


CCCGTTGATC 


420 


AGTTTACTCC 


TTTAGGTCGC 






TGATGGAAAG 


TGGGGAGAAC 


480 


ATGAACTTGA 


CTACCTACTT 


TTfATAHTnA 

x x \^r\ x no l 




Itj 1 AAACCCG 


AACCCAGATG 


540 


AAGTGGCGGA 


TATCAAATAT 


GTGACPAGAA 


P A T Z\ A Tl PP 


AloC I tjL. I AACa 


T\ TV TV ^ % TV y^* #v\ 

G AAAG C AG AT 


600 


GCGGGGGAGG 


AGGGTTTGAA 


PPTGTPTPPA 


I UV3 1 J. L^rt VJO 1 


J. AVj 1 QjLj i 1 bA 


TAACTTCTTG 


660 


TTCAAGTGGT 


GGGATCATGT 


GCAAAAGGGT 


nV/n^ 1 ^./Ai^, x v3 


T\ Tv TV TV ipmp TV 


rp 7» rp TV Tv 7v tv 

TATGAAAACC 


720 


ATACACAAGC 


TGATATAGAA 




AAL< U ij AAAAL) 


T*T>yT\ T\ r^r^r^m tv 

TTCAAGCCTA 


ATAATTCGGG 


780 


TTGGGTCGGG 


TCTACCATCA 


ATTGTTTTTT 


TCTTTTAAGA 


AGTTTTAATC 


TCTATTTGAG 


84 0 


CATGTTGATT 


CTTGTCTTTT 


GTGTGTAAGA 


TTTTGGGTTT 


CGTTTCAGTT 


GTAATAATGA 


900 


ACCATTGATG 


GTTTGCAATT 


TCAAGTTCCT 


ATCGACATGT 


AGTGATCTAA 


AAAA 


954 


<210> 35 

<211> 1031 

<212> DNA 

<213> Oryza sativa 












<400> 35 
CCTCCCTTTG 


CCTCGCGCAG 


AGGCGGCCGC 


GCCTTCTCCG 


CCGCGAGGAT 


GGCCGGCGCC 


60 


GCCGCCGCCG 


TGGAGGACGC 


CGGGATGGAC 


GAGGTCCAGA 


AGCGGCTCAT 


GTTCGACGAC 


120 


GAATGCATTT 


TGGTGGATGA 


ACAAGACAAT 


GTTGTTGGCC 


ATGAATCAAA 


ATATAACTGC 


180 


CATCTGATGG 


AAAAAATCGA 


ATCTGAAAAT 


CTACTTCATA 


GGGCTTTCAG 


TGTATTCCTG 


240 
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TTCAACTCAA AATATGAACT CCTACTCCAG CAACGATCTG CAACAAAGGT TACATTT CCT 300 

CTAGTTTGGA CCAACACTTG CTGCAGCCAT CCTCTGTACC GTGAGTCTGA GCTTATACAG 360 

GAAAACTACC TTGGTGTTAG AAATGCTGCT CAGAGGAAGC TCTTGGATGA GCTGGGCATC 420 

CCAGCTGAAG ATGTGCCAGT TGACCAATTC ACCCCTCTTG GTCGGATGCT TTACAAGGCC 4 80 

CCATCTGATG GAAAATGGGG TGAACACGAG CTTGACTACC TGCTGTTCAT CGTCCGCGAC 54 0 

GTGAAGGTAG TCCCGAACCC GGACGAAGTG GCCGATGTGA AATACG TGAG CCGTGAGCAG 600 

CTGAAGGAGC TCATCCGCAA AGCGGACGCC GGAGAGGAAG GCCTGAAGCT GTCTCCCTGG 660 

TTCCGGCTGG TTGTTGACAA CTTCCTCATG GGCTGGTGGG ATCACGTCGA GAAAGGCACC 720 

CTCAACGAGG CCGTGGACAT GGAGACCATC CACAAGCTGA AGTAAGGACT GCGATGTTGT 780 

GGCTGGAAAG AATGATCCTG AAGACTCTGT TCTTGTGCTG CTGCATATTA CTCTTACCAG 84 0 

GGAAGTTGCA GAAGTCAGAA GAAGCTTTTG TATGTTTCTG GGTTTGGAGC TTGGAAGTGT 900 

TGGGCTCTGC TGACTGAGAG ATTCCCTTAT AGAGTGTCTA TGTTAATTTA GCAAACTTCT 960 

ATATTATACA TGATTAGTTA ATTGTTCGGT GTCTGAATAA AGAACAATAG CATGTTCCAT 1020 

GTTTATTTGC T 1031 

<210> 36 

<211> 232 

<212> PRT 

<213> Tagetes erecta 

<400> 36 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
1 5 10 15 

Asp Asp Glu Cys lie Leu Val Asp Glu Cys Asp Asn Val Val Gly His' 
20 25 30. 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Thr Gly Lys 
35 40 45 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
65 70 " 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 ' 95 

Val Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
100 105 HO 

Leu Asp Glu Leu Gly He Pro Ala Glu Asp Val Pro Val Asp Gin Phe 
115 120 125 

Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp 
130 135 140 



St 
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Gly Glu His Glu Leu Asp Tyr Leu Leu Phe He Val Arg Asp Val Ala 

145 150 155 " 160 

Val Asn Pro Asn Pro Asp Glu Val Ala Asp He Lys Tyr Val Ser His 

165 1 170 " ' 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly 
180 185 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 

Lys Trp Trp Asp His Val Gin Lys Gly Thr Leu Thr Glu Ala He Asp ' 
210 215 220 



220 

Met Lys Thr He His Lys Leu He 
225 230 

<210> 37 
<211> 280 
<212> PRT 

<213> Lactuca Sativa 
<400> 37 

Met Leu Lys Phe Pro Pro Phe Lys Thr He Ala Thr Met lie Ser Ser 
15 10 15 

Pro Tyr Ser Ser Phe Leu Leu 'Pro Arg Lys Ser Ser Phe Pro Pro Met 
20 25 30 

Pro Ser Leu Ala Ala Ala Ser Val Phe Leu His Pro Leu Ser Ser Ala 
35 40 45 

Ala Met Gly Asp Ser Ser Met Asp Ala Val Gin Arg Arg Leu Met Phe 
50 55 60 

Asp Asp Glu Cys He Leu Val Asp Glu Asn Asp Lys Val Val Gly His 
65 70 75 80 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Lys Gly Asn 
85 90 95 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
100 105 HO 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
115 120 125 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
130 135 140 

He Asp Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
145 150 155 160 

Leu Asp Glu Leu Gly He Pro Gly Ala Asp Val Pro Val Asp Glu Phe 
165 170 175 

Thr Pro Leu Gly Arg He Leu Tyr Lys Ala Ala Ser Asp Gly Lys Trp 
180 185 190 

Gly Glu His Glu Leu Asp Tyr Leu Leu Phe Met Val Arg Asp Val Gly 
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195 200 205 

Leu Asp Pro Asn Pro Asp Glu Val Lys Asp Val Lys Tyr Val Asn Arq 
210 215 220 

Glu Glu Leu Lys Glu Leu Val Arg Lys Ala Asp Ala Gly Glu Glu Gly 
225 230 235 ' 240 

Val Lys Leu Ser Pro Trp Phe Lys Leu He Val Asp Asn Phe Leu Phe 
245 250 255 

Gin Trp Trp Asp Arg Leu His Lys Gly Thr Leu Thr Glu Ala He Asp 
260 265 • 270 

Met Lys Thr He His Lys Leu Thr 
275 280 

<210> 38 
<211> 229 
<212> PRT 

<213> Lactuca Sativa 
<400> 38 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
1 5 10 is 

Asp Asp Glu Cys He Leu Val Asp Glu Asn Asp Asn Val Leu Gly His 
20 25 30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Lys Asp Asn 
35 40 45 

Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Glu Thr Lys Val Thr Phe Pro Leu Val 
65 70 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu- Ser Glu Leu 
85 90 95 

He Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
100 105 HO 

Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Glu Phe 
115 120 125 

Thr Thr Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys TrD 
130 135 140 

Gly Glu His Glu Val Asp Tyr Leu Leu Phe Leu Val Arg Asp Val Ala 
145 150 155 ~ 160 

Val Asn Pro Asn Pro Asp Glu Val Ala Asp He Arg Tyr Val Asn Gin 
165 170 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly 
180 185 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 



BNSDOCID: <WO 9963055A1_1_> 



WO 99/63055 PCT/US99/12121 

Lys Trp Trp Asp His Val Gin Lys Gly Thr Leu Asn Glu Ala lie Asp 
210 215 220 

Met Lys Thr He His 
225 

<210> 39 
<211> 295 
<212> PRT 

<213> Adonis Palaestina 
<400> 39 

Met Ser Ser He Arg He Asn Pro Leu Tyr Ser He Phe Ser Thr Thr ■ 
15 10 15 

Thr Lys Thr Leu Ser Ala Ser Cys Ser Ser Pro Ala Val His Leu Gin 
20 25 30 

Gin Arg Cys Arg Thr Leu Ser He Ser. Ser Ser He Thr Asn Ser Pro 
35 40 45 

Arg Arg Gly Leu Asn Arg Leu Phe Ala Ser Thr Ser Thr Met Gly Glu 
50 55 60 

Val Ala Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu Met Phe Asp 
65 70 75 80 

Asp Glu Cys He Leu Val Asp Glu Asn Asp Lys Val Val Gly Tyr Asp 
85 90 95 

Ser Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Ala Glu Asn Leu 
100 105 110 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu 
115 120 125 

Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val Trp 
130 135 140 

Thr Asn Thr Cys Cys Ser- His Pro Leu Phe Arg Asp Ser Glu Leu He 
145 150 155 160 

Glu Glu Asn Phe Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Leu 
165 170 175 

Asp Glu Leu Gly He Pro Ala Glu Asp Val Pro Val Asp Glu Phe Thr 
180 185 190 

Pro Leu Gly Arg He Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly 
195 200 205 

Glu His Glu Leu Asp Tyr Leu Leu Phe He Val Arg Asp Val Lys Tyr 
210 215 220 

Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val Asn Arg Glu 
225 230 235 240 

Glu Leu Lys Glu He Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly He 
245 250 • 255 

Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe Lys 

3*1 
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260 265 270 

Trp Trp Asp His Val Glu Glu Gly Lys lie Lys Asp Val Ala Asp Met 
275 280 285 

Lys Thr lie His Lys Leu Thr 
290 295 

<210> 40 
<211> 234 
<212> PRT 

<213> Adonis Palaestina 
<400> 40 

Met Gly Glu Val Thr Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu 
1 5 10 15 

Met Phe Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val 
20 25 30 

Gly His Asp Ser Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala 
35 40 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro 
65 70 " 75 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Phe Arg Asp Ser 
85 90 95 

Glu Leu lie Glu Glu Asn Tyr Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 110 

Lys Leu Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg. lie Leu Tyr Lys Ala Pro Ser Asp Gly 
130 135 140 

Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp 
145 150 155 160 

Val Lys Tyr Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val 
165 170 175 

Asn Arg Glu Glu Leu Arg Glu lie Leu Arg Lys Ala Asp Ala Gly Glu 
180 185 190 

Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Phe Lys Trp Trp Asp His Val Glu Gin Gly Thr lie Lys Glu Val 
210 215 220 

Ala Asp Met Lys Thr lie His Lys Leu Thr 
225 230 

<210> 41 
<211> 238 
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<212> PRT 
<213> Oryza Sativa 

<400> 41 

Met Ala Gly Ala Ala Ala Ala Val Glu Asp Ala Gly Met Asp Glu Val 
1 5 10 15 

Gin Lys Arg Leu Met Phe Asp Asp Glu Cys He Leu Val Asp Glu Gin 
20 25 30 

Asp Asn Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu Met Glu 
35 4 0 4 5 

Lys He Glu Ser Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu 
50 55 60 

Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys 
65 70 75 / 80 

Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu 
85 90 95 

Tyr Arg Glu Ser Glu Leu He Gin Glu Asn Tyr Leu Gly Val Arg Asn 
100 105 HO 

Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly He Pro Ala Glu Asp 
115 120 125 

Val Pro Val Asp Gin Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala 
130 135 " 140 

Pro Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe 
i4 . 5 150 155 ' 160 

He Val Arg Asp Val Lys Val Val Pro Asn Pro Asp Glu Val Ala Asp 
165 170 175 

Val Lys Tyr Val Ser Arg Glu Gin Leu Lys Glu Leu He Arg Lys Ala 
180 185 190 

Asp Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val 
195 200 205 

Val Asp Asn Phe Leu Met Gly Trp Trp Asp His Val Glu Lys Gly Thr 
210 215 - 220 

Leu Asn Glu Ala Val Asp Met Glu Thr He His Lys Leu Lys 
225 230 235 

<210> 42 
<211> 233 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 

Met Thr Asp Ser Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu 
1 5 10 15 

Met Phe Glu Asp Glu Cys He Leu Val Asp Glu Asn Asn Arg Val Val 
20 25 30 
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Gly His Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala 
35 40 45 

Glu Asn Leu Leu His ,Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro 
65 70 75 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser 
85 90 95 

Glu Leu lie Glu Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 110 

Lys Leu Phe Asp Glu Leu Gly lie Val Ala Glu Asp Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly 
130 135 140 

Lys Trp Gly Glu His Glu Val Asp Tyr Leu Leu Phe lie Val Arg Asp 
145 150 155 160 

Val Lys Leu Gin Pro Asn Pro Asp Glu Val Ala Glu lie Lys Tyr Val 
165 170 175 

Ser Arg Glu Glu Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp 
180 185 190 

Glu Ala Val Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr lie Thr Glu Ala 
210 215 220 

Ala Asp Met Lys Thr lie His Lys Leu 
225 230 

<210> '43 
<211> 293 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 43 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
1 5 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arg Gly 
35 40 " 45 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu Cys lie Leu Val Asp Val Glu Asp Asn lie Thr Gly His Ala 
65 70 " 75 ~ 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 
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85 90 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arq Leu- 
100 105 110 

Leu Leu Gin Gin Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp 
115 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lvs 
145 150 155 Y i£ 0 . 

Ala Ala Ala He Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His 
165 170 - 175 

Gin Leu Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cvs 
180 185 190 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Glv Glu 
195 200 205 

His Glu Met Asp Tyr He Leu Phe He Arg Ala Asn Val Thr Leu Ala 
210 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 230 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 255 

Phe Arg He He Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu 
260 265 * 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 
275 280 285 

His He Asn Glu Ala 
290 

<210> 44 
<211> 304 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 44 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His He Pro Arq Val Asn 
1 5 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser He Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys He 
65 70 75 80 



43 
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Leu 



Val Asp Val 



Glu Asp Asn 

. 85 



lie Thr Gly His Ala Ser Lys Leu Glu 
90 95 



His Lys Phe 
100 



Leu Pro His 



Pro Ala Gly Leu Leu His Arg Ala Phe 
105 110 



Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin Arg 
115 120 125 

Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp Thr Asn Thr Cys Cys 
130 135 140 

Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu Ser 
145 150 155 160 

Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala lie Arg 
165 170 175 

Lys Leu Glu His Glu Leu Gly He Pro Ala His Gin Leu Pro Ala Ser 
180 185 190 

Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val Gin 
195 200 205 

Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp Tyr 
210 215 220 

He Leu Phe He Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp Glu 
225 230 235 240 

Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met Met 
245 250 255 

Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp Phe Arg He lie Ala 
260 265 270 

Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu Asn 
275 280 285 

Thr Asp Lys His Glu Asp Trp Gly Thr Val His His He Asn Glu Ala 
290 295 300 

<210> 45 

<211> 307 

<212> PRT 

<213> Chlamydomonas reinhardtii 

<400> 45 

Met Arg Ser Ser Phe He Glu Pro Lys Pro Arg Ala Gin Pro Val Leu 
1 5 10 15 

Ser Arg Gly Arg Ala Ser Met Arg Leu Ala Gin Ser Arg Ala Leu Val 
20 25 30 

Ala Arg Val Ser Ser Ala Leu Trp Pro Gly Ala Gly Leu Ser Gin Ala 
35 40 " 45 

Qln Ser Val Ala Val Arg Met Ala Ser Ser Ser Thr Trp Glu Gly Thr 
50 55 60 

Gly Leu Ser Gin Asp Asp Phe Met Gin Arg Asp Glu Cys Leu Val Val 
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65 70 .75 "BO 

Asp Glu Gin Asp Arg Leu Leu Gly Thr Ala Asn Lys Tyr Asp Cys His 
85 90 95 

Arg Phe Glu Ala Ala Lys Gly Gin Pro Cys Gly Arg Leu His Arg Ala 
100 105 " HO 

Phe Ser Val Phe Leu Phe Ser Pro Asp Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Arg Ala Ala Ser Lys Val Thr Phe Pro Gly Val Trp Thr Asn Thr Cys 
130 135 140 

■> 

Cys Ser His Pro Leu Ala Gly Gin Ala Pro Asp Glu Val Asp Leu Pro 
145 150 155 160 

Ala Ala Val Ala Ser Gly Gin Val Pro Gly lie Lys Ala Ala Ala Val 
165 170 " 175 

Arg Lys Leu Gin His Glu Leu Gly lie Pro Pro Glu Gin Val Pro Ala 
180 185 190 

Ser Ser Phe Ser Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Thr 
195 200 205 

Ala Thr His Gly Pro Ala Ala Glu Trp Gly Glu His Glu Val Asp Tyr 
210 215 220 

Val Leu Phe Val Arg Pro Gin Gin Pro Val Ser Leu Gin Pro Asn Pro 
225 230 235 240 

Asp Glu Val Asp Ala Thr Arg Tyr Val Thr Leu Pro Glu Leu Gin Ser 
245 250 255 

Met Met Ala Asp Pro Gly Leu Ser Trp Ser Pro Trp Phe Arg lie Leu 
260 265 270 

Ala Thr Gin Pro Ala Phe Leu Pro Ala Trp Trp Gly Asp Leu Lys Arg 
275 280 285 

Arg Trp Arg Pro Gly Gly Ser Arg Leu Ser Asp Trp Gly Thr lie His 
290 295 300 

Arg Val Met 
305 

<210> 46 
<211> 1848 
<212> DNA 

<213> Adonis palaestina 
<400> 46 

GAGAGAAAAA GAGTGTTATA TTAATGTTAC TGTCGCATTC TTGCAACACA TAT TC AG ACT 60 
CCATTTTCTT GTTTTCTCTT CAAAACAACA AACTAATGTG ACGGAGTATC TAGCTATGGA 120 
ACTACTTGGT GTTCGCAACC TCATCTCTTC TTGCCCTGTC TGGACTTTTG GAACAAGAAA 180 
CCTTAGTAGT TCAAAACTAG CTTATAACAT ACATCGATAT GGTTCTTCTT G TAG AG TAG A 24 0 



M2T 
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TTTTPAAGTP 




P""T/"* , P > A A P*P"»P"*r* 
b> 1 GbjAAbjbbjG 


GAGTAGAACT 


TCTGTTGCTT 


ATAAAGAGGG 


300 


TTTTRTnnar 

*■ A X ± O 1 OOrtb 


rarrarraTT 

bj/ibjbj/Vbjbj/i 1 I 


1 lAAbAAAtjb 


TGGTGGTTCT 


GAGCTTTTGT 


TTGTCCAAAT 


360 


GPAGPA aara 

o rtVj ^ rV/"Vrib/\ 


AAo ItlAl bjG 


AGAAAbAGGb 


CAAGCTCGCC 


GATAAGTTGC 


CACCAATACC 


420 


TTTPPP APA A 


ILlbl b>Al bjbj 


TV ftl fr» #T> /"> <ti 


AATAGGTTGT 


GGACCTGCTG 


GTCTTTCACT 


4 80 


GGPTGPAGAA 


p P t p p t z\ a pp 


1 A.bjbjb 1 i LjAA 


AGTTGGCCTT 


ATTGGTCCTG 


ATCTTCCTTT 


540 


TACAAATAAT 


TATGGTGTGT 


(^nnaa^arra 

oounnbnbbn 


P r P r P/"* A A 7\ <^ 7\ T" 

«1 J. LAAAbA I 


CTTGGACTTG 


AACGTTGTAT 


600 


CGAGCATGCT 


TGGAAGGAPA 


PP ATPPT AT A 


Ibl 1 bj/\UAA. 1 


GATGCTCCTG 


TCCTTATTGG 


660 


TCGTGCATAT 


GGAPPAPTTA 


PPPPPP A TTT 


PPT 1 A T 1 f* A A 

bb 1 VjCA 1 b»AA 


GAGTTGCTGA 


AAAGGTGTGT 


720 


CGAGTCAGGT 


GT ATP AT ATP 
w a ,f\ a v^rt A a b 


tp a aTTPTa a 




tv tv A""«m/""» tv t\ 

ATCACTGAAG 


CTGGTGATGG 


780 


CCATAGTCTT 


GTAGTTTPTP 

O A nu A A 1 vj 1 u 


a a a a pp n p a t 

Hrt/irtbbnbn 1 


bi i lAi ccc r 


TGCAGGCTTG 


CTACTGTTGC 


840 


ATCTGGAGCA 


PPTTPAPPPA 


./-V/-ib 1111 Vjbj/i 


L» 1 A 1 bAAGTA 


GGTGGCCCTC 


GTGTTTGTGT 


900 


CCAAACTGCT 

Vi» J*-*x*i X VJw X 


TATPPTPTPP 


.riOO 1 1 o/VVJO 1 


P/~* A/^AA/^A ATI 
obj AbjAAC AA I 


/"^ TV m TV tv m 

CCATACGATC 


CCAACTTAAT 


960 


GGTATTTATG 

X XX XXX *i X \J 


GAPTAPAGAG 


APTATATPPA 




CAGTGCTCGG 


AAGAAGAATA 


1020 


TCCAACATTT 

x vnnvn x x x 


PTPTATPTPA 


TPPPPATPTP 
x bbLbn i bib 


rrp A A P* A A/~* a 


CTTTTTTTTG 


AGGAAACCTG 


1080 


TTTGGCPTPA 


AAAPATPPPA 


1 oLL 111 v-o/\ 


ibl Ab I bAAG 


AG AAAAC T AA 


TGTCACGATT 


1140 


GAAGAPTPTP 


Inl bb/^rt.0 


1 1 ftb/WWi 1 


rri rp tv rpo tv t\ tv r> 

1 1AI L>AAG AG 


GAATGGTCTT 


ATATTCCTGT 


1200 


TGGP.GP.TTPT 


x iy*bbAAA.b/\ 


UAbjALibAAAA 


GAACCTAGCA 


TTTGGTGCTG 


CAGCAAGCAT 


1260 


P P T P P A T rr A 

OVJ J. o^-rt 1 bb.ri 


bjbAAbAbjbjb 1 


ATTCGGTTGT 


ACGATCACTA 


TCAGAAGCTC 


CAAAATATGC 


1320 


T T P TP T A A T T 


CC7\ A A P' A <?T«? 
bjbAAAbjAl 1 1 


TCaAAGCAAGA 


TAACTCTGCA 


TATGTGGTTT 


CTGGACAAAG 


1380 


CAG TGP AG T A 


AAPATTTP&A 
nn^n III b/\M 


I bbft/ibbA X bj 


bjAGCAGTCTT 


TGGCCAAAGG 


AGCGAAAACG 


1440 


TCAAAGAGPA 


rp rp r*~i r 1 1 rp rp r—*rp rp rp 
i-lblllblll 


1 bbVjbj 1 1 /Abj/A 


l*^P > T ,r P A rprp /"tit /~» 

bjb i 1A11 GTG 


O t\ r^ m tv tv rrv tv 

CAG C TAG AT A 


TTGAAGCAAC 


1500 


CAGAACGTTC 


TTTAGAAPPT 
AAA nunriv«^ x 


1 V- 1 1 bbbb 1 1 


PPP A a PTTrr 
bbbAAb 1 1 brbj 


T\ rr» /^» rri f~* rr\ 

AT<jTGGTGGG 


GTTTCCTTGG 


1560 


GTCTTCACTA 


TP ATPTTTPP 


aTPT TP TBTT 
nlbl lv5l/ll 1 


olill bbA 1 G 


TACATGTTTG 


TTTTGGCCCC 


1620 


GAACAGCATG 


AGGATGTCAC 


TTGTGAGACA 


TTTGCTTTCA 


GATCCTTCTG 


GTGCAGTTAT 


1680 


GGTTAAAGCT 


TACCTCGAAA 


GGTAATCTGT 


TTTATGAAAC 


TATAGTGTCT 


CAT T AAAT AA 


1740 


ATGAGGATCC 


TTCGTATATG 


TATATGATCA 


TCTCTATGTA 


TATCCTATAT 


TCTAATCTCA 


1800 


TAAAGTAATC 


GAAAATTCAT 


TGATAGAAAA 


AAAAAAAAAA 


AAAAAAAA 




1848 



<210> 47 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 47 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
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1 "5 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie ' 
20 25 ' 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 ^ 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 ■ 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 95 

Lys Leu Pro Pro He Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 , HO 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu He Phe lie Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
24 5 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 



V 
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Lys Leu Met Ser "Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val He Ala Lys He Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 " 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn He Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 ' 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 ~ 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 

<210> 48 
<211> 378 
<212> PRT 
<213> Potato 

<400> 48 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys He Glu His Val Trp 
15 10 15 

Arg Asp Thr He Val Tyr Leu Asp Asp Asp Asp Pro He Leu He Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His Glu Glu Leu Leu 
35 40 45 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn Ser Lys Val Asp 
50 55 60 

Arg He Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
65 70 75 80 

Asp Val Val He Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 

85 90 95 
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Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly Pro Arg Val Ser Val 
100 105 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 ' 120 125 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp 
130 135 140 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 * ~ 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 1 
165 170 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
180 185 190 

Asn Thr Leu Gly Val Arg lie Lys Glu , lie Tyr Glu Glu Glu Trp Ser 
195 200 205 

Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu 
210 215 220 

Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 230 235 240 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
245 250 255 

Asn lie Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 270 

Thr Pro Ser lie Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 285 

Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu Ala Leu lie Leu 
290 295 300 

Gin Leu Asp lie Glu Gly lie Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 . 310 315 320 

Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa 
325 330 335 

Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He He Ala Pro Asn 
340 345 350 

Asp Met Arg Arg Gly Leu He Arg His Leu Leu Ser Asp Pro Thr Gly 
355 360 365 

Ala Thr Leu He Arg Thr Tyr Leu Thr Phe 
370 375 

<210> 49 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 49 
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Met 
1 



Glu Cys Val 



Gly Ala Arg Asn Phe Ala 
5 10 



Ala Met 



Ala Val 



Ser Thr 
15 



Phe 



Pro Ser Trp 
20 



Ser Cys Arg Arg Lys Phe 
25 



Pro Val 



Val Lys 
30 



Arg Tyr 



Ser Tyr Arg Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 J 95 

Asp Lys Leu Pro Pro He Ser He Gly Asp Gly Ala Leu Asp His Val 
100 105 no 

Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 J 160 

Lys Cys He Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 ^ 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 " 205 

Tyr Leu Ser Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 ~ 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
3 °5 310 315 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
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325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg He Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro 
355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 , 400 

Lys Tyr Ala Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
405 410 " 415 

Gin lie Asn Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425' 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 445 

He Val Gin Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 
485 490 495 

Pro Asn Asn Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 50 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 50 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
1 5 10 - 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn He 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
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85 90 95 

Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 , 105 no 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arq . 

150 155 ~ 160 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu He Phe He Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 " 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val * 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tvr Ara 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
2 90 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val He Ala Lys He Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 
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Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Tip 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Argdn Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Ara Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Glv Phe 
465 470 475 y 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val' Met Val Arg Ala Tyr Leu Glu 
• 515 520 .525 

Arg 



<210> 51 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 51 

Met Glu Leu Leu Gly Val Arg Ash Leu He Ser Ser Cys Pro Val Trp 
1 .5 .10 is ■ 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Thr Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 * 60 

Asp Glu Glu Asp Phe He Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 95 

Lys Leu Pro Pro He Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 110 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Ara 
145 150 155 160 
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Cys lie Glu His "Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He .Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 " 205 

Leu Asn Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Asp He Phe He Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arq 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys He 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val He Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn He Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp .Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
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"4 85 4 90 4 95""" 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Lys Ala Tyr Leu Glu 
515 520 1 525 

Arg 



<210> 52 

<211> 533 

<212> PRT 

<213> Lettuce 

<400> 52 

Met Glu Cys Phe Gly Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
1 5 10 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn lie Arg His Lys Phe Ser Leu 

, 20 25 30 

Leu Lys Gin Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arq 
35 40 45 

Gin He Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50. 55 60 

Gly He Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Glv Glv Ser 
65 70 " 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin He Pro He Gly Asn Cys He 
100 105 no 

Leu Asp Leu Val Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu Ue Gly Pro Asp 
130 135 140 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe He Glv 
145 150 155 160 

Leu Gly Leu Glu Gly Cys He Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asp Asp Ala Asp Pro He Arg He Gly Arg Ala Tyr Gly Arg 
180 185 " 190 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg He Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu lie Glu Cys Glu Gly Asn He Thr He Pro 
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225 230 235 240 

Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245, 250 255 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly 
260 265 270 

lie Glu Val Glu Val Glu Asn Asn Pro Tyr Asp Pro Asp Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Phe Ser Lys His Lys Pro Glu Ser Leu Glu . 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Val Met Ala Met Ser Pro Thr Lys 
305 310 315 320 

lie Phe Phe Glu Glu Thr Cys Leu Ala Ser Arg Glu Ala Met Pro Phe 
325 330 335 

Asn Leu Leu Lys Ser Lys Leu Met Ser Arg Leu Lys Ala Met Gly lie 
340 345 " 350 

Arg lie Thr Arg Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 400 

Ser Glu Ala Pro Ash Tyr Ala Ala Val lie Ala Lys lie Leu Arg Gin 
405 410 415 

Asp Gin Ser Lys Glu Met lie Ser Leu Gly Lys Tyr Thr Asn lie Ser 
420 425 " 430 

Lys Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg 
435 440 445 

Ala Phe Phe Leu Phe Gly Leu Ser His lie Val Leu Met Asp Leu Glu 
450 455 460 

Gly Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp Met 
465 470 475 480 

Trp Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu lie lie 
4 85 4 90 4 95 

Phe Ala Leu Tyr Met Phe Val lie Ala Pro His Ser Leu Arg Met Glu 
500 505 510 

Leu Val Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Met Val Lys 
515 520 525 

Ala Tyr Leu Thr lie 
530 

<210> 53 

SX 
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<211> 526 
<212> PRT 
<213> Tomato 

<40.0> 53 

Met Glu Cys Val Gly Val Gin Asn Val Gly Ala Met Ala Val Leu Thr 
1 5 * 10 15 

Arg Pro Arg Leu Asn Arg Trp Ser Gly Gly Glu Leu Cys Gin Glu Lys 
20 25 30 

Ser lie Phe Leu Ala Tyr Glu Gin Tyr Glu Ser Lys Cys Asn Ser Ser 
35 40 45 

Ser Gly Ser Asp Ser Cys Val Val Asp Lys Glu Asp Phe Ala Asp Glu 
50 55 60 

Glu Asp Tyr He Lys Ala Gly Gly Ser Gin Leu Val Phe Val Gin Met 
65 70 75 80 

Gin Gin Lys Lys Asp Met Asp Gin Gin Ser Lys Leu Ser Asp Glu Leu 
85 90 95 

Arg Gin He Ser Ala Gly Gin Thr Val Leu Asp Leu Val Val He Gly 
100 105 110 

Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Gly 
115 120 125 

Leu Asn Val Gly Leu Val Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
130 135 140 

Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys He 
145 150 155 160 

Glu His Val Trp Arg Asp Thr lie Val Tyr Leu Asp Asp Asp Glu Pro 
165 170 175 

He Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Phe Leu His 
180 185 190 

Glu Glu Leu Leu Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn 
195 200 " 205 

Ser Lys Val Asp Arg He Val Glu Ala Thr Asn Gly Gin Ser Leu Val 
210 215 220 

Glu Cys Glu Gly Asp Val Val He Pro Cys Arg Phe Val Thr Val Ala 
225 230 235 240 

Ser Gly Ala Ala Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Ser Pro 
245 250 255 

Arg Val Ser Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn 
260 265 270 

Asn Pro Phe Asp Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr 
275 280 285 

Leu Arg His Asp Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu 
290 295 300 



r? 
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Tyr Ala Met Pro" Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys 
305 310 315 320 

Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu 
325 330 335 

Met Leu Arg Leu Asn Thr Leu Gly Val Arg lie Lys Glu lie Tyr Glu 
340 345 ' 350 

Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 
355 360 365 

Gin Lys Thr Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
370 375 380 

Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala 
385 390 395 400 

Ser Val Leu Ala Asn He Leu Arg Gin His Tyr Ser Lys Asn Met Leu 
405 410 415 

Thr Ser Ser Ser He Pro Ser He Ser Thr Gin Ala Trp Asn Thr Leu 
420 425 430 

Trp Pro Gin Glu Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu 
435 440 445 

Ala Leu He Leu Gin Leu Asp He Glu Gly He Arg Ser Phe Phe Arg 
450 455 460 

Ala Phe Phe Arg Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser 
465 470 475 480 

Ser Leu Ser Ser Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He 
485 490 ' 495 

He Ala Pro Asn Asp Met Arg Lys Gly Leu He Arg His Leu Leu Ser 
500 505 510 

Asp Pro Thr Gly Ala Thr Leu He Arg Thr Tyr Leu Thr Phe 
515 520 525 

<210> 54 
<211> 516 
<212> PRT 

<213> Tagetes erecta 
<400> 54 

Met Ser Met Arg Ala Gly His Met Thr Ala Thr Met Ala Ala Phe Thr 
15 10 15 

Cys Pro Arg Phe Met Thr Ser He Arg Tyr Thr Lys Gin He Lys Cys 
20 25 ~ 30 

Asn Ala Ala Lys Ser Gin Leu Val Val Lys Gin Glu He Glu Glu Glu 
35 40 45 

Glu Asp Tyr Val Lys Ala Gly Gly Ser Glu Leu Leu Phe Val Gin Met 
50 55 60 
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Gin Gin Asn Lys Ser Met Asp Ala Gin Ser Ser Leu Ser Gin Lys Leu 
65 70 75 ~ 80 

Pro Arg Val Pro lie Gly Gly Gly Gly Asp Ser Asn Cys lie Leu Asp 
8 5 90 95 

Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Gly Glu 
100 105 110 

Ser Ala Lys Leu Gly Leu Asn Val Ala Leu He Gly Pro Asp Leu Pro 
115 120 125 

Phe Thr Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe He Gly Leu Gly 
130 ' 135 140 

Leu Glu Gly Cys He Glu His Val Trp Arg Asp Thr Val Val Tyr Leu 
145 150 155 160 

Asp Asp Asn Asp Pro He Leu lie Gly' Arg Ala Tyr Gly Arg Val Ser 
165 170 "* 175 

Arg Asp Leu Leu His Glu Glu Leu Leu Thr Arg Cys Met Glu Ser Gly 
180 185 190 

Val Ser Tyr Leu Ser Ser Lys Val Glu Arg He Thr Glu Ala Pro Asn 
195 200 205 

Gly Leu Ser Leu He Glu Cys Glu Gly Asn He Thr He Pro Cys Arg 
210 215 220 

Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr 
225 230 235 240 

Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly He Glu 
245 250 255 

Val Glu Val Glu Ser He Pro Tyr Asp Pro Ser Leu Met Val Phe Met 
260 265 270 

Asp Tyr Arg Asp Tyr Thr Lys His Lys Ser Gin Ser Leu Glu Ala Gin 
275 280 285 

Tyr Pro Thr Phe Leu Tyr Val Met Pro Met Ser Pro Thr Lys Val Phe 
290 295 300 

Phe Glu Glu Thr Cys Leu Ala Ser Lys Glu Ala Met Pro Phe Glu Leu 
305 310 315 320 

Leu Lys Thr Lys Leu Met Ser Arg Leu Lys Thr Met Gly He Arg He 
325 " 330 335 

Thr Lys Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser 
340 345 350 

Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser 
355 360 365 

Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu 
370 375 380 

Ala Pro Asn Tyr Ala Ala Val lie Ala Lys He Leu Gly Lys Gly Asn 
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385 



390 



395 



4~00 



Ser Lys Gin 



Met 



Leu 
405 



Asp His Gly Arg Tyr Thr Thr Asn lie Ser Lys 
410 415 



Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg Ala 
420 425 430 

Phe Phe Leu Phe Gly Leu Ala Leu He Val Gin Met Asp He Glu <51y 
435 440 445 

Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Tro 
450 455 460 

Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu He He Phe 

470 475 480 

Ala Phe Tyr Met Phe lie lie Ala Pro His Ser Leu Arg Met Gly Leu 
485 490 495 

Val Arg His Leu Leu Ser Asp Pro Thr Gly Gly Thr Met Leu Lys Ala 
500 505 510 

Tyr Leu Thr He 



<210> 55 
<211> 501 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 55 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Asp Phe Phe He Pro 
1 5 10 15 

Gin Phe His Gly Phe Glu Arg Leu Cys Ser Asn Asn Pro Tyr His Ser 
20 25 30 

Arg Val Arg Leu Gly Val Lys Lys Arg Ala lie Lys He Val Ser Ser 
35 40 45 

Val Val Ser Gly Ser Ala Ala Leu Leu Asp Leu Val Pro Glu Thr Lys 
50 55 60 

Lys Glu Asn Leu Asp Phe Glu Leu Pro Leu Tyr Asp Thr Ser Lys Ser 
65 70 75 * 80 

Gin Val Val Asp Leu Ala He Val Gly Gly Gly Pro Ala Gly Leu Ala 
85 90 95 

Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp 
100 105 no 

Pro Ser Pro Lys Leu He Trp Pro Asn Asn Tyr Gly Val Trp Val Asp 
115 120 125 

Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr Thr Trp Ser 
130 135 ' 140 

Gly Ala Val Val Tyr Val Asp Glu Gly Val Lys Lys Asp Leu Ser Arg 



515 
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1^5 150 155 160 

Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Leu Gin ' 
165 170 " 175 

Lys Cys lie Thr Asn Gly Val Lys Phe His Gin Ser Lys Val Thr Asn 
180 185 190 

Val Val His Glu Glu Ala Asn Ser Thr Val Val Cys Ser Asp Gly Val 
195 200 205 

Lys lie Gin Ala Ser Val Val Leu Asp Ala Thr Gly Phe Ser Arg Cys 
210 215 220 

Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tyr 
225 230 235 240 

Gly He Val Ala Glu Val Asp Gly His Pro Phe Asp Val Asp Lys Met 
245 250 * 255 

Val Phe Met Asp Trp Arg Asp Lys His Leu Asp Ser Tyr Pro Glu Leu 
260 265 270 

Lys Glu Arg Asn Ser Lys He Pro Thr Phe Leu Tyr Ala Met Pro Phe 
275 280 285 

Ser Ser Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro 
290 295 300 

Gly Leu Arg Met Glu Asp He Gin Glu Arg Met Ala Ala Arg Leu Lys 
305 310 315 320 

His Leu Gly He Asn Val Lys Arg He Glu Glu Asp Glu Arg Cys Val 
325 330 335 

He Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly 
340 345 350 

He Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val 
355 360 365 

Ala Arg Thr Leu Ala Ala Ala Pro He Val Ala Asn Ala He Val Arg 
370 375 380 

Tyr Leu Gly Ser Pro Ser Ser Asn Ser Leu Arg Gly Asp Gin Leu Ser 
385 390 395 ~ * 400 

Ala Glu Val Trp Arg Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg 
4 05. 410 - 

Glu Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Asp 
420 425 ' 430 

Ala Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Gin Pro His Tyr 
435 440 445 

Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu Val 
450 455 460 

Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu 
465 470 475 480 
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lie Met Thr Lys Gly Thr Val Pro Leu Ala Lys Met lie Asn Asn Leu 
485 490 495 

Val Gin Asp Arg Asp 
500 

<210> 56 
<211> 502 
<212> PRT 

<213> Adonis palaestina 
<400> 56 

Met Asp Thr Leu Leu Arg Thr His Asn Lys Leu Glu Leu Leu Pro Thr 
1 5 10 15 

Leu His Gly Phe Ala Glu Lys Gin His Leu Val Ser Thr Ser Lys Leu 
20 25- 30 

Gin Asn Gin Val Phe Arg lie Ala Ser Arg Asn lie His Pro Cys Arg 
35 40 45 

Asn Gly Thr Val Lys Ala Arg Gly Ser Ala Leu Leu Glu Leu Val Pro 
50 55 60 

Glu Thr Lys Lys Glu Asn Leu Glu Phe Asp Leu Pro Ala Tyr Asp Pro 
65 70 75 80 

Ser Arg Gly lie Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala 
85 90 95 

Gly Leu Ala lie Ala Gin Gin Val Ser Glu Ala Gly Leu Leu Val Cys 
100 105 110 

Ser lie Asp Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val 
115 120 125 

Trp Val Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr 
130 135 140 

Thr Trp Ser Gly Ala Val Val Tyr Thr Asp Asp Asn Ser Lys Lys Tyr 
145 150 155 " 160 

Leu Asp Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys 
165 170 175 

Met Leu Gin Lys Cys Val Thr Asn Gly Val Lys Phe His Gin Ala Lys 
180 185 190 

Val lie Lys Val lie His Glu Glu Ser Lys Ser Leu Leu lie Cys Asn 
195 200 205 

Asp Gly lie Thr lie Asn Ala Thr Val Val Leu Asp Ala Thr Gly Phe 
210 215 220 

Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 
225 230 235 240 

Val Ala Tyr Gly lie Met Ala Glu Val Glu Glu His Pro Phe Asp Leu 
245 250 255 
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Asp Lys Met Leu Phe Met Asp Trp Arg Asp Ser His Leu Asn Glu Lys 
260 265 270 

Leu Glu Leu Lys Asp Lys Asn Arg Lys lie Pro Thr Phe Leu Tyr Ala 
275 280 285 

Met Pro Phe Ser Ser Thr Lys lie Phe Leu Glu Glu Thr Ser Leu Val 
290 295 300 

Ala Arg Pro Gly Leu Arg Phe Glu Asp lie Gin Glu Arg Met Val Ala 
305 310 315 320 

Arg Leu Lys His Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu 
325 330 335 

Arg Cys Val lie Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg 
34 0 34 5 350 

Val Val Gly lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly 
355 360 c 365 

Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Lys Ser 
370 375 380 

He Val Gin Tyr Leu Gly Ser Asp Arg Ser Leu Ser Gly Asn Glu Leu 
385 390 395 400 

Ser Ala Glu Val Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin 
405 410 " 415 

Arg Glu Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu 
420 ' 425 430 

Gin Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro His 
435 440 445 

Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu 
450 455 460 

Phe Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Ala Ser Arg He 
465 470 475 480 

Glu He Met Ala Lys Gly Thr Val Pro Leu Val Asn Met Met Asn Asn 
485 490 495 

Leu He Gin Asp Thr Asp 
500 



<210> 57 
<211> 498 
<212> PRT 
<213> Pepper 

<400> 57 

Met Asp Thr Leu Leu Arg Thr Pro Asn Asn Leu Glu Phe Leu His Gly 
1 5 10 15 

Phe Gly Val Lys Val Ser Ala Phe Ser Ser Val Lys Ser Gin Lys Phe 
20 25 * 30 
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Gly Ala Lys Lys Phe Cys Glu Gly Leu Gly Ser Arg Ser Val Cys Val 
35 40 45 

Lys Ala Ser Ser Ser, Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys 
50 55 60 

Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val 
65 70 75 " 80 

Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val 
8 5 90 95 

Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp Pro 
100 105 HO 

Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu 
115 120 ~ 125 

Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly 
130 135 140 

Ala Ala Val Tyr lie Asp Asp Lys Thr Thr Lys Asp Leu Asn Arg Pro 
145 150 155 160 

Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met Gin Lys 
165 170 " 175 

Cys lie Leu Asn Gly Val Lys Phe His Gin Ala Lys Val lie Lys Val 
180 185 190 

lie His Glu Glu Ser Lys Ser Met Leu lie Cys Asn Asp Gly lie Thr 
195 200 205 

lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg Ser Leu 
210 215 220 

Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tyr Gly 
225 230 235 240 

lie Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys Met Val 
245 250 255 

Phe Met Asp Trp Arg Asp Ser His Leu Lys Asn Asn Val Glu Leu Lys 
260 265 270 

Glu Arg Asn Ser Arg lie Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser 
275 280 285 

Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly 
290 295 300 

Leu Gly Met Asp Asp lie Gin Glu Arg Met Val Ala Arg Leu Ser His 
305 310 315 320 

Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu His Cys Val lie 
325 330 335 

Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly lie 
34 0 34 5 350 

Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala 
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355 360 365 

Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala lie He Gin Tyr 
37 0 375 , 380 

Leu Ser Ser Glu Arg Ser His Ser Gly Asp Glu Leu Ser Ala Ala Val 
385 390 395 400 

Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu Phe Phe 
4 05 410 415 

Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala Thr Arq 
^20 425 430 

Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp His Gly 
435 440 445 

Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu He Val Phe Gly Leu 
450 455 1 460 

Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu He Met Thr 
465 470 475 480 

Lys Gly Thr Leu Pro Leu Val His Met lie Asn Asn Leu Leu Gin Asp 
485 490 495 

Lys Glu 



<210> 58 
<211> 500 
<212> PRT 
<213> Tomato 

<400> 58 

Met Asp Thr Leu Leu Lys Thr Pro Asn Asn Leu Glu Phe Leu Asn Pro 
15 10 15 

His His Gly Phe Ala Val Lys Ala Ser Thr Phe Arg Ser Glu Lys His 
20 25 ~ 30 

His Asn Phe Gly Ser Arg Lys Phe Cys Glu Thr Leu Gly Arg Ser Val 
35 40 45 

Cys Val Lys Gly Ser Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 55 60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 70 75 * 80 

Gly Val Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 
85 90 95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser He 
100 105 HO 

Asp Pro Asn Pro Lys Leu He Trp Pro Asn Asn Tyr Gly Val Trp Val 
US 120 125 

Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
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130 135 14 0 

Ser Gly Ala Ala Val Tyr He Asp Asp Asn Thr Ala Lys Asp Leu His 
145 ,150 155 " 160 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 170 175 

Gin Lys Cys He Met Asn Gly Val Lys Phe His Gin Ala Lys Val He 
180 185 190 

Lys Val He His Glu Glu Ser Lys Ser Met Leu He Cys Asn Asp Gly 
195 200 205 

He Thr He Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg 
210 215 220 

Ser Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala 
225 230 235 ^ ' 240 

Tyr Gly lie Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys 
245 250 * 255 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Lys Asn Asn Thr Asp 
260 265 270 

Leu Lys Glu Arg Asn Ser Arg lie Pro Thr Phe Leu Tyr Ala Met Pro 
275 280 285 

Phe Ser Ser Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Arg 
290 295 300 

Pro Gly Leu Arg lie Asp Asp lie Gin Glu Arg Met Val Ala Arg Leu 
305 310 315 320 

Asn His Leu Gly lie Lys Val Lys Ser lie Glu Glu Asp Glu His Cys 
325 330 * 335 

Leu lie Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val 
340 345 350 

Gly lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He 
370 375 380 

Gin Tyr Leu Gly Ser Glu Arg Ser His Ser Gly Asn Glu Leu Ser Thr 
385 390 395 400 

Ala Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg Gin Arg Glu 
405 410 415 

Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu Pro Ala 
4 20 4 25 4 30 

Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp 
435 440 445 

His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu lie Val Phe 
450 455 460 
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Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Phe Glu lie 
465 470 475 " 480 

. Met Thr Lys Gly Thr Val Pro LeuVal Asn Met lie Asn Asn Leu Leu 

485 490 495 

Gin Asp Lys Glu 
500 



<210> 59 
<211> 500 
<212> PRT 
<213> Tobacco 

<400> 59 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Glu Phe Leu His Pro 
15 10 15 

Val His Gly Phe Ser Val Lys Ala Ser Ser Phe Asn Ser Val Lys Pro 
20 25 30 

His Lys Phe Gly Ser Arg Lys lie Cys Glu Asn Trp Gly Lys Gly Val 
35 4 0 4 5 

Cys Val Lys Ala Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 55 60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 70 75 80 

Gly Leu Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 
85 90 -95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Val Ser lie 
100 105 110 

Asp Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val 
115 120 " 125 

Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
130 135 14 0 

Ser Gly Thr Val Val Tyr lie Asp Asp Asn Thr Thr Lys Asp Leu Asp 
145 150 ^ 155 160 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 170 175 

Gin Lys Cys lie Leu Asn Gly Val Lys Phe His His Ala Lys Val lie 
180 185 190 

Lys Val lie His Glu Glu Ala Lys Ser Met Leu lie Cys Asn Asp Gly 
195 200 205 

Val Thr lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg 
210 215 220 

Cys Leu Val Gin Tyr Asp Lys Pro Tyr Lys Pro Gly Tyr Gin Val Ala 
225 230 235 240 
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Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Thr Ser Lys 
245 250 255 . 

Met Val Leu Met Asp, Trp Arg Asp Ser His Leu Gly Asn Asn Met Glu 
260 265 270 

Leu Lys Glu Arg Asn Arg Lys Val Pro Thr Phe Leu Tyr Ala Met Pro 
275 280 285 

Phe Ser Ser Asn Lys He Phe Leu Glu Glu Thr Ser Leu Val Ala Ara 
290 295 300 

Pro Gly Leu Arg Met Asp Asp He Gin Glu Arg Met Val Ala Arq Leu 
305 310 315 ^ 320 

Asn His Leu Gly He Lys Val Lys Ser He Glu Glu Asp Glu His Cys 
325 330 335 

Val He Pro Met Gly Gly Ser Leu Pro Val lie Pro Gin Arg Val Val 
340 345 350 

Gly Thr Gly Gly Thr Ala Gly Leu Val His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He 
370 375 380 

His Tyr Leu Gly Ser Glu Lys Asp Leu Leu Gly Asn Glu Leu Ser Ala 
385 390 395 4 00 

Ala Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg Gin Arg Glu 
405 410 " 415 

Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala 
420 425 430 

Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp 
435 440 * 445 

His Gly Phe Leu Ser Ser Arg Leu Tyr Leu Pro Glu Leu He Phe Phe 
450 455 460 

Gly Leu Ser Leu Phe Ser Arg Ala Ser Asn Thr Ser Arg lie Glu He 
465 470 475 " 480 

Met Thr Lys Gly Thr Leu Pro Leu Val Asn Met lie Asn Asn Leu Leu 
485 490 495 

Gin Asp Thr Glu 
500 



<210> 60 
<211> 511 
<212> PRT 

<213> Tagetes erecta 



<400> 60 

Met Asp Thr Phe Leu Arg Thr Tyr Asn Ser Phe Glu Phe Val His Pro 
15 10 15 
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Ser Asn Lys Phe Ala Gly Asn Leu Asn Asn Leu Asn Gin Leu Asn Gin 
20 25 30 

Ser Lys Ser Gin Phe Gin Asp Phe Arg Phe Gly Pro Lys Lys Ser Gin 
35 4 0 4 5 

Phe Lys Leu Gly Gin Lys Tyr Cys Val Lys Ala Ser Ser Ser Ala Leu 
50 55 60 

Leu Glu Leu Val Pro Glu lie Lys Lys Glu Asn Leu Asp Phe Asp Leu 
65 70 75 80 

Pro Met Tyr Asp Pro Ser Arg Asn Val Val Val Asp Leu Val Val Val 
85 90 95 

Gly Gly Gly Pro Ser Gly Leu Ala Val Ala Gin Gin Val Ser Glu Ala ' 
100 105 110 

Gly Leu Thr Val Cys Ser lie Asp Pro !Ser Pro Lys Leu lie Trp Pro 
115 120 . 125 

Asn Asn Tyr Gly Val Trp Val Asp Glu Phe ,Glu Ala Met Asp Leu Leu 
130 135 140 

Asp Cys Leu Asp Thr Thr Trp Ser Ser Ala Val Val Tyr lie Asp Glu 
145 150 155 " 160 

Lys Ser Thr Lys Ser Leu Asn Arg Pro Tyr Ala Arg Val Asn Arg Lys 
165 170 175 

Gin Leu Lys Thr Lys Met Leu Gin Lys Cys lie Ala Asn Gly Val Lys 
180 ' 185 190 

Phe His Gin Ala Lys Val lie Lys Val lie His Glu Glu Leu Lys Ser 
195 200 205 

Leu Leu He Cys Asn Asp Gly Val Thr He Gin Ala Thr Leu Val Leu 
210 215 220 

♦ 

Asp Ala Thr Gly Phe Ser Arg Ser Leu Val Gin Tyr Asp Lys Pro Tyr 
225 230 235 240 

Asn Pro Gly Tyr Gin Val Ala Tyr Gly He Leu Ala Glu Val Glu Glu 
245 250 255 

His Pro Phe Asp Val Asp Lys Met Leu Phe Met Asp Trp Arg Asp Ser 
260 265 270 

His Leu Asp Gin Asn Leu Glu He Lys Ala Arg Asn Ser Arg He Pro 
275 280 285 

Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser Thr Arg He Phe Leu Glu 
290 295 300 

Glu Thr Ser Leu Val Ala Arg Pro Gly Leu Lys Met Glu Asp He Gin 
305 310 315 " 320 

Glu Arg Met Ala Tyr Arg Leu Lys His Leu Gly He Lys Val Lys Ser 
325 330 " 335 

He Glu Glu Asp Glu Arg Cys Val He Pro Met Gly Gly Pro Leu Pro 



Us 
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34 0 34 5 350 

Val Leu Pro Gin Arg Val Leu Gly lie Gly Gly Thr Ala Gly Met Val 
355 . 360 365 

His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro 
370 375 380 

lie Val Ala Lys Ser lie lie Arg Tyr Leu Asn Asn Glu Lys Ser Met 
385 390 395 400 

Val Ala Asp Val Thr Gly Asp Asp Leu Ala Ala Gly lie Trp Arg Glu . 

405 410 415 

Leu Trp Pro lie Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly 
420 425 430 

Met Asp lie Leu Leu Lys Leu Asp Leu Glu Gly Thr Arg Arg Phe Phe 
435 440 445 

Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser 
450 455 460 

Ser Arg Leu Phe Leu Pro Glu Leu Val Thr Phe Gly Leu Ser Leu Phe 
465 470 475 * 480 

Gly His Ala Ser Asn Thr Cys Arg Val Glu lie Met Ala Lys Gly Thr 
485 490 495 

Leu Pro Leu Ala Thr Met lie Gly Asn Leu Val Arg Asp Arg Glu 
500 505 510 

<210> 61 
<211> 503 
<212> PRT 
<213> Daffodil 

<400> 61 

Met Asp Thr Leu Leu Arg Thr His Asn Arg Leu Glu Leu Leu Tyr Pro 
15 10 15 

Leu His Glu Leu Ala Lys Arg His Phe Leu Ser Pro Ser Pro Asn Pro 
20 25 30 

Gin Asn Pro Asn Phe Lys Phe Phe Ser Arg Lys Pro Tyr Gin Lys Lys 
35 40 45 

Cys Arg Asn Gly Tyr lie Gly Val Ser Ser Asn Gin Leu Leu Asp Leu 
50 55 60 

Val Pro Glu lie Lys Lys Glu His Leu Glu Phe Asp Leu Pro Leu Tyr 
65 70 75 80 

Asp Pro Ser Lys Ala Leu Thr Leu Asp Leu Ala Val Val Gly Gly Gly 

85 90 95 

Pro Leu Ala Arg Ser Cys Ser Thr Ser Leu Gly Gly Gly Leu Ser Val 
100 105 110 

Val Ser lie Asp Pro Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly 
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115 120 125 

Val Trp Val Asp Glu Phe Glu Asp Met Asp Leu Leu Asp Cys Leu Asp 
130 135 140 

Ala Thr Trp Ser Gly Ala lie Val Tyr Val Asp Asp Arg Ser Thr Lys 
145 . 150 155 ~ 160 

Asn Leu Ser Arg Pro Tyr Ala Arg Val Asn Arg Lys Asn Leu Lys Ser 
165 170 ~ 175 

Lys Met Met Lys Lys. Cys Val Ser Asn Gly Val Arg Phe His Gin Ala 
180 185 190 

Thr Val Val Lys Ala Met His Glu Glu Glu Lys Ser Tyr Leu lie Cys 
195 200 205 

Ser Asp Gly Val Thr lie Asp Ala Arg Val Val Leu Asp Ala Thr Gly 
210 215 ' 220 

Phe Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr 
225 230 235 240 

Gin Val Ala Tyr Gly lie Leu Ala Glu Val Glu Glu His Pro Phe Asp 
245 250 255 

Val Asp Lys Met Val Phe Met Asp Trp Arg Asp Ser His Leu Asn Gly 
260 265 270 

Lys Ala Glu Leu Asn Glu Arg Asn Ala Lys lie Pro Thr Phe Leu Tyr 
275 280 285 

Ala Met Pro- Phe Ser Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu 
290 295 300 

Val Ala Arg Pro Gly Leu Lys Met Glu Asp lie Gin Glu Arg Met Val 
305 310 " 315 320 

Ala Arg Leu Asn His Leu Gly lie Arg lie Lys Ser lie Glu Glu Asp' 
325 330 335 

Glu Arg Cys Val lie Pro Met Gly Gly Pro Leu Pro Val lie Pro Gin 
340 345 350 

Arg Val Val Gly He Gly Gly Thr Ala Gly Met Val His Pro Ser Thr 
355 360 365 

Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro He Val Ala Asn 
370 375 380 

Ser He Val Gin Tyr Leu Val Ser Asp Ser Gly Leu Ser Gly Asn Asp 
385 390 395 "* 400 

Leu Ser Ala Asp Val Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg 
405 410 415 

Gin Arg Glu Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp 
420 " 425 * 430 

Leu Glu Gly Thr Arg Arg . Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro 
4 35 44 0 44 5 
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Arg Tyr Trp His' Gly Phe Leu Ser 
450 455 

Val Pro Phe Gly Leu Ser Leu Phe 
465 470 

Leu Glu lie Met Ala Lys Gly Thr 
485 

Asn Leu Val Gin Asp Arg Asp 
500 



Ser Arg Leu Phe Leu Pro Glu Leu 
4 60 



Ser His Ala Ser Asn Thr Cys Lys 

475 480 

Leu Pro Leu Val Asn Met lie Asn 

490 495 
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GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM 
AND METHODS OF USE THEREOF 

BACKGROUND OF THE INVENTION 
Field of the Invention 

5 The present invention describes nucleic acid sequences for eukaryotic genes encoding 

€ lycopene e-cyclase (also known as e-cyclase and e lycopene cyclase), isopentenyl 
pyrophosphate isomerase (IPP) and p -carotene hydroxylase as well as vectors containing the 
same and hosts transformed with said vectors. The present invention also provides methods 
for augmenting the accumulation of carotenoids, changing the composition of the 

10 carotenoids, and producing novel and rare carotenoids. The present invention provides 

methods for controlling the ratio or relative amounts of various carotenoids in a host. The 
invention also relates to modified lycopene e-cyclase, IPP isomerase and p -carotene 
hydroxylase. Additionally, the present invention provides a method for screening for genes 
and cDNAs encoding enzymes of carotenoid biosynthesis and metabolism. 

15 Backgroun d of the Invention 

Carotenoid pigments with cyclic endgroups are essential components of the 
photosynthetic apparatus in oxygenic photosynthetic organisms {e.g., cyanobacteria, algae 
and plants; Goodwin, 1980). The symmetrical bicyclic yellow carotenoid pigment p- 
carotene (or, in rare cases, the asymmetrical bicyclic a -carotene) is intimately associated with 

20 the photosynthetic reaction centers and plays a vital role in protecting against potentially 
lethal photooxidative damage (Koyama, 1991). P -carotene and other carotenoids derived 
from it or from a -carotene also serve as light-harvesting pigments (Siefermann-Harms, 
1987), are involved in the thermal dissipation of excess light energy captured by the light- 
harvesting antenna (Demmig- Adams & Adams, 1992), provide substrate for the biosynthesis 

25 of the plant growth regulator abscisic acid (Rock & Zeevaart, 1991; Parry & Horgan, 1991), 
and are precursors of vitamin A in human and animal diets (Krinsky, 1987). Plants also 
exploit carotenoids as coloring agents in flowers and fruits to attract pollinators and agents of 
seed dispersal (Goodwin, 1980). The color provided by carotenoids is also of agronomic 
value in a number of important crops. Carotenoids are currently harvested from a variety of 

30 organisms, including plants, algae, yeasts, cyanobacteria and bacteria, for use as pigments in 
food and feed. 
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The probable pathway for formation of cyclic carotenoids in plants,.algae and 
cyanobacteria is illustrated in Figure 1. Two types of cyclic endgroups or rings are 
commonly found in higher plant carotenoids, these are referred to as the p (beta) and e 
(epsilon) rings (Fig. 3). The precursor acyclic endgroup (no ring structure) is referred to as 
5 the Y ipsi) endgroup. The P and e endgroups differ only in the position of the double bond in 
the ring. Carotenoids with two P rings are ubiquitous, and those with one p and one e ring 
are common, but carotenoids with two e rings are uncommon, p-carotene (Fig. 1) has two P- 
endgroups and is a symmetrical compound that is the precursor of a number of other 
important plant carotenoids such as zeaxanthin and violaxanthin (Fig. 2). 
I o Genes encoding enzymes of carotenoid biosynthesis have previously been isolated 

from a variety of sources including bacteria (Armstrong et al., 1989, Mol. Gen. Genet. 216, 
254-268; Misawa et al., 1990, J. Bacterid., 172, 6704-12), fungi (Schmidhauser et al., 1990, 
Mol. Cell. Biol. 10, 5064-70), cyanobacteria (Chamovitz et al., 1990, Z. Naturforsch, 45c, 
482-86; Cunningham et al., 1994) and higher plants (Bartley et al., Proc. Natl. Acad. Sci USA 
15 88, 6532-36; Martinez-Ferez & Vioque, 1992, Plant Mol. Biol. 18, 981-83). Many of the 
isolated enzymes show a great diversity in structure, function and inhibitory properties 
between sources. For example, phytoene desaturases from the cyanobacterium 
Synechococcus and from higher plants and green algae carry out a two-step desaturation to 
yield C-carotene as a reaction product. In plants and cyanobacteria a second enzyme (C- 
20 carotene desaturase), similar in amino acid sequence to the phytoene desaturase, catalyzes 

two additional desaturations to yield lycopene. In contrast, a single desaturase enzyme from 
Erwinia herbicola and from other bacteria introduces all four double bonds required to form 
lycopene. The Erwinia and other bacterial desaturases bear little amino acid sequence 
similarity to the plant and cyanobacterial desaturase enzymes, and are thought to be of 
25 unrelated ancestry. Therefore, even with a gene in hand from one source, it may be difficult to 
identify a gene encoding an enzyme of similar function in another organism. In particular, 
the sequence similarity between certain of the prokaryotic and eukaryotic genes encoding 
enzymes of carotenoid biosynthesis is quite low. 

Further, the mechanism of gene expression in prokaryotes and eukaryotes appears to 
30 differ sufficiently such that one cannot expect that an isolated eukaryotic gene will be 
properly expressed in a prokaryotic host. 
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The difficulties in isolating genes encoding enzymes with similar functions is 
exemplified by recent efforts to isolate the gene encoding the enzyme that catalyzes the 
formation of p -carotene from the acyclic precursor lycopene. Although a gene encoding an 
enzyme with this function had been isolated from a bacterium, it had not been isolated from 
5 any photosynthetic procaryote or from any eukaryotic organism. The isolation and 

characterization of the enzyme catalyzing formation of p -carotene in the cyanobacterium 
Synechococcus PCC7942 was described by the present inventors and others (Cunningham et 
al., 1993 and 1994). The amino acid sequence similarity of the cyanobacterial enzyme to the 
various bacterial lycopene p-cyclases is so low (ca. 18-25% overall; Cunningham et al., 
1 0 1 994) that there is much uncertainty as to whether they share a common ancestry or, instead, 
represent an example of convergent evolution. 

The need remains for the isolation of eukaryotic and prokaryotic genes and cDNAs 
encoding polypeptides involved in the carotenoid biosynthetic pathway, including those 
encoding a lycopene e-cyclase, IPP isomerase and p -carotene hydroxylase. There remains a 
1 5 need for methods to enhance the production of carotenoids, to alter the composition of 

carotenoids, and to reduce or eliminate carotenoid production. There also remains a need in 
the art for methods for screening for genes and cDNAs encoding enzymes of carotenoid 
biosynthesis and metabolism. 

SITMMARV O F TFTF TNVFNTTON 
20 Accordingly, a first object of this invention is to provide purified and/or isolated 

nucleic acids which encode enzymes involved in carotenoid biosynthesis; in particular, 
lycopene e-cyclase , IPP isomerase and p -carotene hydroxylase. 

A second object of this invention is to provide purified and/or isolated nucleic acids 
which encode enzymes which produce novel or uncommon carotenoids. 
25 A third object of the present invention is to provide vectors containing said genes. 

A fourth object of the present invention is to provide hosts transformed with said 
vectors. 

Another object of the present invention is to provide hosts which accumulate novel or 
uncommon carotenoids or which accumulate greater amounts of specific or total carotenoids. 
30 Another object of the present invention is to provide hosts with inhibited and/or 

altered carotenoid production. 
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Another object of this invention is to secure the expression of eukaryotic carotenoid- 
related genes in a recombinant prokaryotic host. 

Yet another object of the present invention is to provide a method for screening for 
eukaryotic and prokaryotic genes and cDNAs which encode enzymes involved in carotenoid 
5 biosynthesis and metabolism. 

An additional object of the invention is to provide a method for manipulating 
carotenoid biosynthesis in photosynthetic organisms by inhibiting the synthesis of certain 
en2ymatic products to cause accumulation of precursor compounds. 

Another object of the invention is to provide modified lycopene e-cyclase, IPP 
10 isomerase and P -carotene hydroxylase. 

These and other objects of the present invention have been realized by the present 

inventors as described below. 

A subject of the present invention is an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p-carotene 
1 5 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS: 2, 4, 14- 
21 or 23-27. 

The invention also includes vectors which comprise any of the nucleic acid sequences 
listed above, and host cells transformed with such vectors. 

Another subject of the present invention is a method of producing or enhancing the 
20 production of a carotenoid in a host cell, comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase, IPP isomerase or p-carotene hydroxylase enzyme activity, wherein the 
heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence to produce the protein. 
25 Yet another subject of the present invention is a method of modifying the production 

of carotenoids in a host cell, the method comprising inserting into the host cell a vector 
comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene e-cyclase, IPP isomerase or p-carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 
30 sequence is operably linked to a promoter, and expressing the heterologous nucleic acid 

sequence in the host cell to modify the production of the carotenoids in the host cell, relative 
to the untransformed host cell. 
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The present invention also includes a method of expressing, in a host cell, a 
heterologous nucleic acid sequence which encodes for a protein having lycopene e-cyclase, 
IPP isomerase or p-carotene hydroxylase enzyme activity, the method comprising inserting 
into the host cell a vector comprising the heterologous nucleic acid sequence, wherein the 
5 heterologous nucleic acid sequence is operably linked to a promoter; and expressing the 
heterologous nucleic acid sequence. 

Also included is a method of expressing, in a host cell, a heterologous nucleic acid 
sequence which encodes for a protein which modifies lycopene e-cyclase, IPP isomerase or 
p-carotene hydroxylase enzyme activity in the host cell, relative to an untransformed host 
1 0 cell, the method comprising inserting into the host cell a vector comprising the heterologous 
nucleic acid sequence, wherein the heterologous nucleic acid sequence is operably linked to a 
promoter; and expressing the heterologous nucleic acid sequence. 

Another subject of the present invention is a method for screening for genes and 
cDNAs which encode enzymes involved in carotenoid biosynthesis and metabolism. 

15 TtTMFF PFSCPTPTTON OF TKF. DK AWTN GS 

A more complete appreciation of the invention and many of the attendant advantages 
thereof will be readily obtained as the same becomes better understood by reference to the 
following detailed description when considered in connection with the accompanying 
drawings, wherein: 

20 Figure 1 is a schematic representation of the putative pathway of p -carotene 

biosynthesis in cyanobacteria, algae and plants. The enzymes catalyzing various steps are 
indicated at the left. Target sites of the bleaching herbicides NFZ and MPTA are also 
indicated at the left. Abbreviations: DMAPP, dimethylallyl pyrophosphate; FPP, farnesyl 
pyrophosphate; GGPP, geranylgeranyl pyrophosphate; GPP, geranyl pyrophosphate; IPP, 
25 isopentenyl pyrophosphate; LCY, lycopene cyclase; MVA, mevalonic acid; MPTA, 2-(4- 
methylphenoxy)triethylamine hydrochloride; NFZ, norflurazon; PDS, phytoene desaturase; 
PSY, phytoene synthase; ZDS, C-carotene desaturase; PPPP, prephytoene pyrophosphate. 

Figure 2 depicts possible routes of synthesis of cyclic carotenoids and common plant 
and algal xanthophylls (oxycarotenolds) from neurosporene. Demonstrated activities of the 
30 p- and e-cyclase enzymes of A. thaliana are indicated by bold arrows labelled with P or e 
respectively. A bar below the arrow leading to e-carotene indicates that the enzymatic 
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activity was examined but no product was detected. The steps marked by an arrow with a 
dotted line have not been specifically examined. Conventional numbering of the carbon 
atoms is given for neurosporene and a-carotene. Inverted triangles (▼) mark positions of the 
double bonds introduced as a consequence of the desaturation reactions. 
5 Figure 3 depicts the carotene endgroups which are found in plants. 

Figure 4 is a DNA sequence and the predicted amino acid sequence of a lycopene e- 
cyclase cDNA isolated from A. thaliana (SEQ ID NOS: 1 and 2). These sequences were 
deposited under Genbank accession number U50738. This cDNA is incorporated into the 
plasmid pATeps. 

10 Figure 5 is a DNA sequence encoding the p -carotene hydroxylase isolated from A. 

thaliana (SEQ ID NO: 3). This cDNA is incorporated into the plasmid pATOHB. 

Figure 6 is an alignment of the predicted amino acid sequences of A. thaliana P- 
carotene hydroxylase (SEQ ED NO: 4) with those of the bacterial p -carotene hydroxylase 
enzymes from Alicalgenes sp. (SEQ ID NO: 5) (Genbank D58422), Erwinia herbicola EholO 
1 5 (SEQ ID NO.: 6) (GenBank M872280), Erwinia uredovora (SEQ ID NO.: 7) (GenBank 
D90087) and Agrobacterium aurianticum (SEQ ID NO.: 8) (GenBank D58420). A 
consensus sequence is also shown. All five genes are identical where a capital letter appears 
in the consensus. A lowercase letter indicates that three of five, including A. thaliana, have 
the identical residue. TM; transmembrane. 
20 Figure 7 is a DNA sequence of a cDNA encoding an IPP isomerase isolated from A. 

thaliana (SEQ ID NO: 9). This cDNA is incorporated into the plasmid pATDPS. 

Figure 8 is a DNA sequence of a second cDNA encoding another IPP isomerase 
isolated from A. thaliana (SEQ ID NO: 10). This cDNA is incorporated into the plasmid 
pATDP7. 

25 Figure 9 is a DNA sequence of a cDNA encoding an TPP isomerase isolated from 

Haematococcus pluvialis (SEQ ID NO: 1 1). This cDNA is incorporated into the plasmid 
pHP04. 

Figure 10 is a DNA sequence of a second cDNA encoding another IPP isomerase 
isolated from Haematococcus pluvialis (SEQ ID NO: 12). This cDNA is incorporated into 

30 the plasmid pHP05. 

Figure 1 1 is an alignment of the amino acid sequences predicted by IPP isomerase 
cDNAs isolated from A. thaliana (SEQ ID NO.: 16 and 18), H. pluvialis (SEQ ID NOS: 14 
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and 15), Clarkia breweri (SEQ ID NO.: 1 7) (See, Blanc & Pichersky, Plant Physiol. (1995) 
108:855; Genbank accession no. X82627) and Saccharomyces cerevisiae (SEQ ID NO.: 19) 
(Genbank accession no. J05090). 

Figure 12 is a DNA sequence of the cDNA encoding an EPP isomerase isolated from 
5 Tagetes erecta (marigold; SEQ ID NO: 13). This cDNA is incorporated into the plasmid 

pPMDPl . xxx's denote a region not originally sequenced. Figure 21 A shows the complete 
marigold sequence. 

Figure 13 is an alignment of the consensus sequence of four plant P-cyclases (SEQ ID 
NO.: 20) with the A. thaliana lycopene e-cyclase (SEQ ID NO.: 21). A capital letter in the 
10 plant p consensus is used where all four p -cyclase genes predict the same amino acid residue 
in this position. A small letter indicates that an identical residue was found in three of the 
four. Dashes indicate that the amino acid residue was not conserved and dots in the sequence 
denote a gap. A consensus for the aligned sequences is given, in capital letters below the 
alignment, where the P- and e -cyclases have the same amino acid residue. Arrows indicate 
1 5 some of the conserved amino acids that will be used as junction sites for construction of 
chimeric cyclases with novel enzymatic activities. Several regions of interest including a 
sequence signature indicative of a dinucleotide-binding motif and two predicted 
transmembrane (TM) helical regions are indicated below the alignment and are underlined. 

Figure 14 shows the nucleotide (SEQ ID NO:22) and amino acid sequences (SEQ ID 
20 NO:23) of the Adonis palaestina (pheasant's eye) e-cyclase cDNA #5. 

Figure 1 5 A shows the nucleotide (SEQ ID NO:24) and amino acid sequences (SEQ 
ID NO:25) of a potato e-cyclase cDNA. Figure 15B shows the amino acid sequence (SEQ ID 
NO:26) of a chimeric lettuce/potato lycopene e-cyclase. Amino acids in lower case are from 
the lettuce cDNA and those in upper case are from the potato cDNA. The product of this 
25 chimeric cDNA has e-cyclase activity and converts lycopene to the monocyclic 5-carotene. 

Figure 16 shows a comparison between the amino acid sequences of the Arabidopsis 
e-cyclase (SEQ ID NO:27) and the potato e-cyclase (SEQ ID NO:25). 

Figure 17A shows the nucleotide sequence of the Adonis palaestina Ipil (SEQ ID 
NO:28) and Figure 17B shows the nucleotide sequence of the Adonis palaestina Ipi2 (SEQ 
30 ID NO: 29). 
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Figure 1 8 A shows the nucleotide sequence of the Haematoccus pluvialis Ipil (SEQ 
ID NO:l 1) and Figure 1 8B shows the nucleotide sequence of the Haematoccus pluvialis Ipi2 
(SEQ ID NO:30). 

Figure 19A shows the nucleotide sequence of the Lactuca sativa (romaine lettuce) 
5 Ipi 1 (SEQ ID NO:3 1 ) and Figure 1 9B shows the nucleotide sequence of the Lactuca sativa 

Ipi2 (SEQ ID NO: 32). 

Figure 20 shows the nucleotide sequence of the Chlamydomonas reinhardtii Ipil 

(SEQ ID NO:33). 

Figure 21 A shows the nucleotide sequence of the Tagetes erecta (marigold) Ipil (SEQ 
10 ID NO:34) and Figure 21B shows the nucleotide sequence of the Oryza sativa (rice) Ipil 
(SEQ ID NO:35). 

Figure 22 shows a amino acid sequence alignment of various plant and green algal 
isopentenyl isomerases (IPI) (SEQ ID NOS:16, 36-45). 

Figure 23 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
1 5 cDNA #5 nucleotide sequences. 

Figure 24 shows a comparison between Adonis palaestina e-cyclase cDNA #3 and 
cDNA #5 predicted amino acid sequences. 

Figure 25 shows a sequence alignment of various plant p- and e-cyclases. Those 
sequences outlined in grey denote identical sequences among the e-cyclases. Those 
20 sequences outlined in black denote identical sequences among both the p - and e-cyclases. 

Figure 26 shows a sequence alignment of the plant e-cyclases from Figure 25. Those 
sequences outlined in black denote identical sequences among the e-cyclases. 

Figure 27 is a dendrogram or "tree" illustrating the degree of amino acid sequence 
similarity for various lycopene p- and e-cyclases. 
25 Figure 28 shows a comparison between Arabidopsis e-cyclase and lettuce e-cyclase 

predicted amino acid sequences. 

nPSCRTPTIPM OF THF. P PFFFRPFn F.MBODTMENTS 
The present invention includes an isolated and/or purified nucleic acid sequence 
which encodes for a protein having lycopene e-cyclase, IPP isomerase or p-carotene 
30 hydroxylase enzyme activity and having the amino acid sequence of SEQ ID NOS : 2, 4, 1 4- 
21, 23 or 25-27. Nucleic acids encoding lycopene e-cyclase, p-carotene hydroxylase and IPP 
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isomerases have been isolated from several genetically distant sources. 

The present inventors have isolated nucleic acids encoding the enzyme EPP isomerase, 
which catalyzes the reversible conversion of isopentenyl pyrophosphate (IPP) to 
dimethylallyl pyrophosphate (DMAPP). IPP isomerase cDNAs were isolated from the plants 
5 A. thaliana, Tagetes erecta (marigold), Adonis palaestina (pheasant's eye), Lactuca sativa 
(romaine lettuce) and from the green algae H. pluvialis and Chlamydomonas reinhardtii. 
Alignments of the amino acid sequences predicted by some of these cDNAs are shown in 
Figures 12 and 22. Plasmids containing some of these cDNAs were deposited with the 
American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on March 4, 
10 1996 under ATCC accession numbers 98000 (pHP05 - H. pluvialis); 98001 (pMDPl - 
marigold); 98002 (pATDP7 - A. thaliana) and 98004 (pHP04 - K pluvialis). 

The present inventors have also isolated nucleic acids encoding the enzyme P- 
carotene hydroxylase, which is responsible for hydroxylating the P-endgroup in carotenoids. 
The nucleic acid of the present invention is shown in SEQ ED NO: 3 and Figure 5. The full 
15 length cDNA product hydroxylates both end groups of p -carotene as do products of cDNAs 
which encode proteins truncated by up to 50 amino acids from the N-terminus. Products of 
genes which encode proteins truncated between about 60-1 1 0 amino acids from the N- 
terminus preferentially hydroxylate only one ring. A plasmid containing this gene was 
deposited with the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 
20 20852 on March 4, 1996 under ATCC accession number 98003 (pATOHB - A. thaliana). 

The present inventors have also isolated nucleic acids encoding the enzyme lycopene 
e-cyclase, which is responsible for the formation of e-endgroups in carotenoids. The A. 
thaliane e-cyclase adds an e ring to only one end of the symmetrical lycopene while the 
related P -cyclase adds a ring at both ends. The A. thaliana cDNA of the present invention is 
25 shown in Figure 4 and SEQ ID NO: 1 . A plasmid containing this gene was deposited with 
the American Type Culture Collection, 12301 Parklawn Drive, Rockville MD 20852 on 
March 4, 1996 under ATCC accession number 98005 (pATeps - A. thaliana). 

In addition, lycopene e-cyclases have been identified in lettuce and in Adonis 
palaestina (cDNA #5) which encode enzymes that convert lycopene to the bicyclic e- 
30 carotene (e,e-carotene). An additional cDNA from Adonis palaestina (cDNA #3) encodes a 
lycopene e-cyclase which converts lycopene into 6-carotene (e,t|/ -carotene) and differs from 
the lycopene e cyclase which forms bicyclic e-carotene (e,e-carotene) by only 5 amino acids. 
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One or more of these amino acids may be modified by alteration of the nucleotide sequence 
in the #5 cDNA to obtain an enzyme which forms the bicyclic e,e-carotene. The sequences 
of the Adonis palaestina and Arabidopsis thaliana €-cyclases have about 70% nucleotide 
identity and about 72% amino acid identity. 

5 Initial experiments by the inventors with chimeric genes indicated that the part of the 

e-cyclase which is responsible for adding 2 e rings to form e,e-carotene is the carboxy 
terminal portion of the gene. The lettuce e-cyclase adds two e rings to form e,e-carotene. A 
DNA encoding a partial potato e-cyclase (missing its amino terminal portion), when 
combined with an amino terminal region from the lettuce e-cyclase gene, produces a 

10 monocyclic 6-carotene (e,\Jf -carotene). With the discovery of the differences between the 

Adonis palaestina clone #3 and clone #5, the specific amino acids responsible for the addition 
of an extra e ring have been identified (Figure 24). Specifically, amino acid 55 is Thr in 
clone #3 and Ser in clone #5, amino acid 210 is Asn in clone #3 and Asp in clone #5, amino 
acid 231 is Asp in clone #3 and Glu in clone #5, amino acid 352 is He in clone #3 and Val in 

1 5 clone #5, and amino acid 524 is Lys in clone #3 and Arg in clone #5. It can be appreciated 

that these changes are quite conservative, as only one change, at amino acid 210, changes the 
charge of the protein. 

Thus, it is clear that the nucleic acids of the invention encoding the enzymes as 
presently disclosed may be altered to increase a particularly desirable property of the enzyme, 

20 to change a property of the enzyme, or to diminish an undesirable property of the enzyme. 

Such modifications can be by deletion, substitution, or insertion of one or more amino acids, 
and can be performed by routine enzymatic manipulation of the nucleic acid encoding the 
enzyme (such as by restriction enzyme digestion, removal of nucleotides by mung bean 
nuclease or BaB\ 9 insertion of nucleotides by Klenow fragment, and by religation of the 

25 ends), by site-directed mutagenesis, or may be accidental, such as by low fidelity PCR or 
those obtained through mutations in hosts that are producers of the enzymes. These 
techniques as well as other suitable techniques are well known in the art. 

Mutations can be made in the nucleic acids of the invention such that a particular 
codon is changed to a codon which codes for a different amino acid. Such a mutation is 

30 generally made by making the fewest nucleotide changes possible. A substitution mutation 
of this sort can be made to change an amino acid in the resulting protein in a non- 
conservative manner (i.e., by changing the codon from an amino acid belonging to a grouping 
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of amino acids having a particular size or characteristic to an amino acid belonging to another 
grouping) or in a conservative manner (i.e., by changing the codon from an amino acid 
belonging to a grouping of amino acids having a particular size or characteristic to an amino 
acid belonging to the same grouping). Such a conservative change generally leads to less 

5 change in the structure and function of the resulting protein. A non-conservative change is 
more likely to alter the structure, activity or function of the resulting protein. The present 
invention should be considered to include sequences containing conservative changes which 
do not significantly alter the activity or binding characteristics of the resulting protein. 
The following is one example of various groupings of amino acids: 

10 Amino acids with nonp olar R groups : Alanine, Valine, Leucine, Isoleucine, Proline, 
Phenylalanine, Tryptophan and Methionine. 

Amino acids with uncharg ed polar R groups : Glycine, Serine, Threonine, Cysteine, Tyrosine, 
Asparagine and Glutamine. 

Amino acids with charg ed polar R groups (negatively charged at Ph 6.0): Aspartic acid and 

15 Glutamic acid. 

ttasic amino acids (positively charged at pH 6.0): Lysine, Arginine and Histidine. 

Another grouping may be those amino acids with phenyl groups: Phenylalanine, 
Tryptophan and Tyrosine. 

Another grouping may be according to molecular weight (i.e., size of R groups). 
20 Particularly preferred substitutions are: 

- Lys for Arg and vice versa such that a positive charge may be maintained; 

- Glu for Asp and vice versa such that a negative charge may be maintained; 

- Ser for Thr such that a free -OH can be maintained; and 

- Gin for Asn such that a free NH 2 can be maintained. 

25 Amino acid substitutions may also be introduced to substitute an amino acid with a 

particularly preferable property. For example, a Cys may be introduced to provide a potential 
site for disulfide bridges with another Cys. A His may be introduced as a particularly 
"catalytic" site (i.e., His can act as an acid or base and is the most common amino acid in 
biochemical catalysis). Pro may be introduced because of its particularly planar structure, 

30 which induces P -turns in the protein's structure. 

It is clear that certain modifications of SEQ ID NOS: 2, 4, 14-21, 23 or 25-27 can take 
place without destroying the activity of the enzyme. It is noted especially that truncated 
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versions of the nucleic acids of the invention are functional. For example, several amino 
acids (from 1 to about 120) can be deleted from the N-terminus of the lycopene € -cyclases of 
the invention, and a functional protein can still be produced. This fact is made especially 
clear from Figure 25, which shows a sequence alignment of several plant e-cyclases. As can 
5 be seen from Figure 25, there is an enormous amount of sequence disparity between amino 
acid sequences 2 to about 50-70 (depending on the particular sequence, since gaps are 
present). There is less, but also a substantial amount of, sequence dissimilarity between about 
50-70 to about 90-120 (depending on the particular sequence). Thereafter, the sequences are 
fairly conserved, except for small pockets of dissimilarity between about 275-295 to about 

10 285-305 (depending on the particular sequence), and between about 395-415 to about 410- 
430 (depending on the particular sequence). 

The present inventors have found that the amount of the 5* region present in the 
nucleic acids of the invention can alter the activity of the enzyme. Instead of diminishing 
activity, truncating the 5' region of the nucleic acids of the invention may result in an enzyme 

15 with a different specificity. Thus, the present invention relates to nucleic acids and enzymes 
encoded thereby which are truncated to within 0-50, preferably 0-25, codons of the 5* 
initiation codon of their prokaryotic counterparts as determined by alignment maps as 
discussed below. 

For example, when the cDNA encoding A. thaliana p -carotene hydroxylase was 
20 truncated, the resulting enzyme catalyzed the formation of p-cryptoxanthin as the major 

product and zeaxanthin as minor product; in contrast to its normal production of zeaxanthin. 

The present invention is intended to include those nucleic acid and amino acid 
sequences in which substitutions, deletions, additions or other modifications have taken 
place, as compared to SEQ ID NOS: 2, 4, 14-21, 23 or 25-27, without destroying the activity 
25 of the enzyme. Preferably, the substitutions, deletions, additions or other modifications take 
place at the 5' end, or any other of those positions which already show dissimilarity between 
any of the presently disclosed amino acid sequences (see also Figure 25) or other amino acid 
sequences which are known in the art and which encode the same enzyme (i.e., lycopene e- 
cyclase, IPP isomerase or P -carotene hydroxylase). 
30 In each case, nucleic acid and amino acid sequence similarity and identity is measured 

using sequence analysis software, for example, the Sequence Analysis, Gap, or BestFit 
software packages of the Genetics Computer Group (University of Wisconsin Biotechnology 
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Center, 1710 University Avenue, Madison, Wisconsin 53705), MEGAlign (DNAStar, Inc., " 
1228 S. Park St., Madison, Wisconsin 53715), or MacVector (Oxford Molecular Group, 2105 
S. Bascom Avenue, Suite 200, Campbell, California 95008). Such software uses algorithms 
to match similar sequences by assigning degrees of identity to various substitutions, 
5 deletions, and other modifications, and includes detailed instructions as to useful parameters, 
etc., such that those of routine skill in the art can easily compare sequence similarities and 
identities. An example of a useful algorithm in this regard is the algorithm of Needleman and 
Wunsch, which is used in the Gap program discussed above. This program finds the 
alignment of two complete sequences that maximizes the number of matches and minimizes 

10 the number of gaps. Another useful algorithm is the algorithm of Smith and Waterman, 
which is used in the BestFit program discussed above. This program creates an optimal 
alignment of the best segment of similarity between two sequences. Optimal alignments are 
found by inserting gaps to maximize the number of matches using the local homology 
algorithm of Smith and Waterman. 

15 Conservative (i.e. similar) substitutions typically include substitutions within the 

following groups: glycine and alanine; valine, isoleucine and leucine; aspartic acid, glutamic 
acid, asparagine and glutamine; serine and threonine; lysine and arginine; and phenylalanine 
and tyrosine. Substitutions may also be made on the basis of conserved hydrophobicity or 
hydrophilicity (see Kyte and Doolittle, J. Mol Biol 157: 105-132 (1982)), or on the basis of 

20 the ability to assume similar polypeptide secondary structure (see Chou and, Fasman, Adv. 
Enzymol. 47: 45-148 (1978)). 

If comparison is made between nucleotide sequences, preferably the length of 
comparison sequences is at least 50 nucleotides, more preferably at least 60 nucleotides, at 
least 75 nucleotides or at least 100 nucleotides. It is most preferred if comparison is made 

25 between the nucleic acid sequences encoding the enzyme coding regions necessary for 

enzyme activity. If comparison is made between amino acid sequences, preferably the length 
of comparison is at least 20 amino acids, more preferably at least 30 amino acids, at least 40 
amino acids or at least 50 amino acids. It is most preferred if comparison is made between 
the amino acid sequences in the enzyme coding regions necessary for enzyme activity. 

30 It should be appreciated that also within the scope of the present invention are nucleic 

acid sequences encoding lycopene e-cyclases, IPP isomerases and p -carotene hydroxylases 
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which code for enzymes having the same amino acid sequence as SEQ ID NOS: 2, 4, 14-21, 
23 or 25-27, but which are degenerate to the nucleic acids specifically disclosed herein. 

The amino acid residues described herein are preferred to be in the "L" isomeric form. 
However, residues in the "D" isomeric form can be substituted for any L-amino acid residue, 
5 as long as the desired functional property of immunoglobulin-binding is retained by the 
polypeptide. 

In accordance with the present invention there may be employed conventional 
molecular biology, microbiology, and recombinant DNA techniques within the skill of the 
art. Such techniques are explained fully in the literature. See, e.g., Sambrook et al, 
10 "Molecular Cloning: A Laboratory Manual" (1989); "Current Protocols in Molecular 
Biology" Volumes Mil [Ausubel, R. M., ed. (1994)]; "Cell Biology: A Laboratory 
Handbook" Volumes I-III [J. E. Celis, ed. (1994))]; "Current Protocols in Immunology" 
Volumes I-III [Coligan, J. E., ed. (1994)]; "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; "Transcription And 
15 Translation" [B.D. Hames & SJ. Higgins, eds. (1984)]; "Animal Cell Culture" [R.I. 

Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A 
Practical Guide To Molecular Cloning" (1984). 

The present invention also includes vectors. Suitable vectors according to the present 
invention comprise a nucleic acid of the invention encoding an enzyme involved in 
20 carotenoid biosynthesis or metabolism and a suitable promoter for the host, and can be 

constructed using techniques well known in the art (for example Sambrook et al., Molecular 
Cloning A Laboratory Manual . Cold Spring Harbor Laboratory, Cold Spring Harbor, NY, 
1989; Ausubel et al., Current Protocols in Molecular Biology . Greene Publishing and Wiley 
Interscience, New York, 1991). Suitable vectors for eukaryotic expression in plants are 
25 described in Frey et al., Plant J. (1995) 8(5):693 and Misawa et al, 1994a; incorporated herein 
by reference. Suitable vectors for prokaryotic expression include pACYCl 84, pUCl 19, and 
pBR322 (available from New England BioLabs, Bevery, MA) and pTrcHis (Invitrogen) and 
pET28 (Novagen) and derivatives thereof. The vectors of the present invention can 
additionally contain regulatory elements such as promoters, repressors, selectable markers 
30 such as antibiotic resistance genes, etc. 

The nucleic acids encoding the carotenoid enzymes as described above, when cloned 
into a suitable expression vector, can be used to overexpress these enzymes in a plant 
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expression system or to inhibit the expression of these enzymes. For example, a vector 
containing the gene encoding lycopene e-cyclase can be used to increase the amount of a- 
carotene and carotenoids derived from a-carotene (such as lutein and a-cryptoxanthin) in an 
organism and thereby alter the nutritional value, pharmacology and visual appearance value 
5 of the organism. 

Therefore, the present invention includes a method of producing or enhancing the 
production of a carotenoid in a host cell, relative to an untransformed host cell, the method 
comprising inserting into the host cell a vector comprising a heterologous nucleic acid 
sequence which encodes for a protein having lycopene €-cyclase, 1PP isom erase or P- 

10 carotene hydroxylase enzyme activity, wherein the heterologous nucleic acid sequence is 
operably linked to a promoter; and expressing the heterologous nucleic acid sequence to 
produce the protein. 

The present invention also includes a method of modifying the production of 
carotenoids in a host cell, the method comprising inserting into the host cell a vector 

1 5 comprising a heterologous nucleic acid sequence which produces an RNA and/or encodes for 
a protein which modifies lycopene €-cyclase, IPP isomerase or p -carotene hydroxylase 
enzyme activity, relative to an untransformed host cell, wherein the heterologous nucleic acid 
sequence is operably linked to a promoter; and expressing the heterologous nucleic acid 
sequence in the host cell to modify the production of the carotenoids in the host cell, relative 

20 to the untransformed host cell. 

The term "modifying the production" means that the amount of carotenoids produced 
in the host cell can be enhanced, reduced, or left the same, as compared to the untransformed 
host cell. In accordance with one embodiment of the present invention, the make-up of the 
carotenoids (i.e., the specific carotenoids produced) is changed vis a vis each other, and this 

25 change in make-up may result in either a net gain, net loss, or no net change in the total 

amount of carotenoids produced in the cell. In accordance with another embodiment of the 
present invention, the production or the biochemical activity of the carotenoids (or the 
enzymes which catalyze their formation) is enhanced by the insertion of an enzyme-encoding 
nucleic acid of the invention. In yet another embodiment of the invention, the production or 

30 the biochemical activity of the carotenoids (or the enzymes which catalyze their formation) 
may be reduced or inhibited by a number of different approaches available to those skilled in 
the art, including but not limited to such methodologies or approaches as anti-sense (e.g., 

-15- 



BNSDOCID: <WO 98B30S5A1 | A > 



WO 99/63055 




PCT/US99/12121 



Gray et al (1992) Plant Mol. BioL 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. 
Gen. Genet. 245:465-470), co-suppression (e.g., Fray and Grierson (1993) Plant Mol. Biol. 
22:589-602), targeted disruption of the gene (e.g., Schaefer et al. (1997) Plant J. 1 1 :1 195- 
1206), intracellular antibodies (e.g., Rondon and Marasco (1997) Ann. Rev. Microbiol. 
5 51 :257-283) or whatever other approaches rely on the knowledge or availability of the nucleic 
acid or amino acid sequences of the invention and/or portions thereof, to thereby reduce 
accumulation of carotenoids with e rings and compounds derived from them (for e-cyclase 
inhibition), or carotenoids with hydroxylated P rings and compounds derived from them (for 
P -hydroxylase inhibition), or, in the case if IPP isomerase, accumulation of any isoprenoid 
10 compound. 

Preferably, at least a portion of the nucleic acid sequences used in the methods, 
vectors and host cells of the invention codes for an enzyme having an amino acid sequence 
which is at least 85% identical, preferably at least 90%, at least 95% or completely identical 
to SEQ ED NOS: 2, 4, 14-21, 23 or 25-27. Sequence identity is determined as noted above. 

15 Preferably, sequence additions, deletions or other modifications are made as indicated above, 
so as to not affect the function of the particular enzyme. 

In a preferred embodiment, vectors are manufactured which contain a DNA encoding 
a eukaryotic IPP isomerase upstream of a DNA encoding a second eukaryotic carotenoid 
enzyme. The inventors have discovered that inclusion of an IPP isomerase gene increases the 

20 supply of substrate for the carotenoid pathway; thereby enhancing the production of 

carotenoid endproducts, as compared to a host cell which is not transformed with such a 
vector. This is apparent from the much deeper pigmentation in carotenoid-accumulating 
colonies of E. coli which also contain one of the aforementioned IPP isomerase genes when 
compared to colonies that lack this additional IPP isomerase gene. Similarly, a vector 

25 comprising an IPP isomerase gene can be used to enhance production of any secondary 
metabolite of dimethylallyl pyrophosphate and/or isopentenyl pyrophosphate (such as 
isoprenoids, steroids, carotenoids, etc.). The term "isoprenoid" is intended to mean any 
member of the class of naturally occurring compounds whose carbon skeletons are composed, 
in part or entirely, of isopentyl C 5 units. Preferably, the carbon skeleton is of an essential oil, 

30 a fragrance, a rubber, a carotenoid, or a therapeutic compound, such as paclitaxel. 

A vector containing the cDNA encoding a lycopene e-cyclase of the invention, 
preferably the lettuce lycopene e-cyclase or Adonis e-cyclase #5, can be used to increase the 
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amount of bicyclic e-carotene in an organism and thereby alter the nutritional value, 
pharmacology and visual appearance value of the organism. In addition, the transformed 
organism can be used in the formulation of therapeutic agents, for example in the treatment of 
cancer (see Mayne et al (1996) FASEB J. 10:690-701; Tsushima et al (1995) Biol. Pharm. 
5 Bull. 18:227-233). 

An antisense strand of a nucleic acid of the invention can be inserted into a vector. 
For example, the lycopene e -cyclase gene can be inserted into a vector and incorporated into 
the genomic DNA of a host, thereby inhibiting the synthesis of e,p -carotenoids (lutein and a- 
carotene) and enhancing the synthesis of P,p -carotenoids (zeaxanthin and P -carotene). 

10 The present invention also relates to novel enzymes which are encoded by the amino 

acid sequences of the invention, or portions thereof. 

The present invention also relates to novel enzymes which can transform known 
carotenoids into novel or uncommon products. Currently e-carotene (see Figure 2) and y- 
carotene are commonly produced only in minor amounts. As described below, an enzyme 

15 can be produced which transforms lycopene to Y-carotene and lycopene to e-carotene.; With 
these products in hand, bulk synthesis of other carotenoids derived from them are possible. 
For example, e-carotene can be hydroxylated to form lactucaxanthin, an isomer of lutein (one 
e and one p ring) and zeaxanthin (two P rings) where both endgroups are, instead, e rings. 

In addition to novel enzymes produced by truncating the 5' region of known enzymes, 

20 as discussed above, novel enzymes which can participate in the formation of unusual 

carotenoids can be formed by replacing portions of one gene with an analogous sequence 
from a structurally related gene. For example, P-cyclase and e-cyclase are structurally 
related (see Figure 13). By replacing a portion of P -lycopene cyclase with the analogous 
portion of e-cyclase, an enzyme which produces y-carotene will be produced (one P 

25 endgroup). Further, by replacing a portion of the lycopene e-cyclase with the analogous 
portion of P-cyclase, an enzyme which produces e-carotene will be produced (with some 
exceptions, such as the lettuce e-cyclase, plant e-cyclases normally produce a compound with 
one e-endgroup, 6-carotene). Similarly, p -hydroxylase could be modified to produce 
enzymes of novel function by creation of hybrids with e-hydroxylase. 

30 Host systems according to the present invention can comprise any organism that 

already produces carotenoids or which has been genetically modified to produce carotenoids. 
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The IPP isomerase genes are more broadly applicable for enhancing production of any 
product dependent on DMAPP and/or IPP as a precursor. 

Organisms which already produce carotenoids include plants, algae, some yeasts, 
fungi and cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
5 with vectors according to the present invention can be done using standard techniques such as 
those described in Misawa et al., (1990) supra; Hundle et al., (1993) supra; Hundle et al., 
(1991) supra; Misawa et al., (1991) supra; Sandmann et al., supra; and Schnurr et al., supra. 

Transgenic organisms can be constructed which include the nucleic acid sequences of 
the present invention (Bird et al, 1991; Bramley et al, 1992; Misawa et al, 1994a; Misawa et 
10 al, 1994b; Cunningham et al, 1993). The incorporation of these sequences can allow the 
controlling of carotenoid biosynthesis, content, or composition in the host cell. These 
transgenic systems can be constructed to incorporate sequences which allow for the 
overexpression of the nucleic acids of the present invention. Transgenic systems can also be 
constructed containing antisense expression of the nucleic acid sequences of the present 
15 invention. Such antisense expression would result in the accumulation of the substrates of the 
substrates of the enzyme encoded by the sense strand. 

A method for screening for eukaryotic genes which encode enzymes involved in 
carotenoid biosynthesis comprises transforming a prokaryotic host with a nucleic acid which 
may contain a eukaryotic or prokaryotic carotenoid biosynthetic gene; culturing said 
20 transformed host to obtain colonies; and screening for colonies exhibiting a different color 
than colonies of the untransformed host. 

Suitable hosts include E. coli, cyanobacteria such as Synechococcus and 
Synechocystis, alga and plant cells. E. coli are preferred. 

In a preferred embodiment, the above "color complementation" screening protocol can 
25 be enhanced by using mutants which are either (1) deficient in at least one carotenoid 

biosynthetic gene or (2) overexpress at least one carotenoid biosynthetic gene. In either case, 
such mutants will accumulate carotenoid precursors. 

Prokaryotic and eukaryotic DNA or cDNA libraries can be screened in total for the 
presence of genes of carotenoid biosynthesis, metabolism and degradation. Preferred 
30 organisms to be screened include photosynthetic organisms. 
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E. coli can be transformed with these eukaryotic cDNA libraries using conventional 
methods such as those described in Sambrook et al, 1989 and according to protocols 
described by the vendors of the cloning vectors. 

For example, the cDNA libraries in bacteriophage vectors such as lambdaZAP 
5 (Stratagene) or lambda ZIPLOX (Gibco BRL) can be excised en masse and used to transform 
E.coli. 

Transformed E. coli can be cultured using conventional techniques. The culture broth 
preferably contains antibiotics to select and maintain plasmids. Suitable antibiotics include 
penicillin, ampicillin, chloramphenicol, etc. Culturing is typically conducted at 15-40°C, 

10 preferably at room temperature or slightly above (18-28°C), for 12 hours to 7 days. 

Cultures are plated and the plates are screened visually for colonies with a different 
color than the colonies of the host E. coli transformed with the empty plasmid cloning vector. 
For example, E. coli transformed with the plasmid, pAC-BETA (described below), produce 
yellow colonies that accumulate p -carotene. After transformation with a cDNA library, 

15 colonies which contain a different hue than those formed by E. coli/p AC-BET A would be 
expected to contain enzymes which modify the structure or accumulation of p -carotene. 
Similar E. coli strains can be engineered which accumulate earlier products in carotenoid 
biosynthesis, such as lycopene, y -carotene, etc. 

Having generally described this invention, a further understanding can be obtained by 

20 reference to certain specific examples which are provided herein for purposes of illustration 
only and are not intended to be limiting unless otherwise specified. 

EXAMPLE 

L Isolation of P -carotene hydroxylase 
Plasmid Construction 

25 An 8.6kb BgUI fragment containing the carotenoid biosynthetic genes of Erwinia 

herbicola was first cloned in the BamHI site of plasmid vector pACYC184 (chloramphenicol 
resistant), and then a 1.1 kb BamHI fragment containing the E. herbicola p -carotene 
hydroxylase (CrtZ) was deleted. E.coli strains containing the resulting plasmid, p AC-BETA, 
accumulate B-carotene and form yellow colonies (Cunningham et al., 1994). 

30 A full length cDNA encoding IPP isomerase of Haematococcus pluvialis (HP04) was 

first excised with BamHI and Kpnl from pBluescript SK-, and then ligated into the 
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corresponding sites of the pTrcHisA vector with high-level expression from the trc promoter 
(Invitrogen, Inc.). A fragment containing the IPP isomerase and trc promoter was 
subsequently excised with EcoKW and Kpn\ 9 treated with the Klenow fragment of DNA 
polymerase to produce blunt ends, and ligated in the Kl enow-treated Hindlll site of p AC- 
BETA. E.coli cells transformed with this new plasmid pAC-BETA-04 form orange colonies 
on LB plates (v5. yellow for those containing pAC-BETA) and cultures accumulate 
substantially more P -carotene (ca. two fold) than those that contain pAC-BETA. 



Screening of an Arabidopsis cDNA Library 

Several X cDNA expression libraries of Arabidopsis were obtained from the 

10 Arabidopsis Biological Resource Center (Ohio State University, Columbus, OH) (Kieber et 
al., 1993). The k cDNA libraries were excised in vivo using Stratagene's ExAssist SOLR 
system to produce a phagemid cDNA library wherein each phagemid contained also a gene 
conferring resistance to the antibiotic ampicillin. 

E.coli strain DH10BZIP was chosen as the host cell for the screening and pigment 

15 production, although we have also used TOPI OF' and XL 1 -Blue for this purpose. DH10B 
cells were transformed with plasmid pAC-BETA-04 and were plated on LB agar plates 
containing chloramphenicol at 50 |ig/ml (from United States Biochemical Corporation). The 
phagemid Arabidopsis cDNA library was then introduced into DH10B cells already 
containing pAC-BETA-04. Transformed cells containing both pAC-BETA-04 and 

20 Arabidopsis cDNA library phagemids were selected on chloramphenicol plus ampicillin (150 
Hg/ml) agar plates. Maximum color development occurred after 3 to 7 days incubation at 
room temperature, and the rare bright yellow colonies were selected from a background of 
many thousands of orange colonies on each agar plate. Selected colonies were inoculated 
into 3 ml liquid LB medium containing ampicillin and chloramphenicol, and cultures were 

25 incubated at room temperature for 1-2 days, with shaking. Cells were then harvested by 
centrifugation and extracted with acetone in micro fiige tubes. After centrifugation, the 
pigmented extract was spotted onto silica gel thin-layer chromatography (TLC) plates, and 
developed with a hexaneiether (1 :1, by volume) mobile phases, fi-carotene hydroxylase- 
encoding cDNAs were identified based on the appearance of a yellow pigment that co- 

30 migrated with zeaxanthin on the TLC plates. 
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Subcloning and Sequencing 

The plasmid containing the fl-carotene hydroxylase cDNA was recovered and 
analyzed by standard procedures (Sambrook et al., 1989). The Arabidopsis B-carotene 
hydroxylase was sequenced completely on both strands on an automatic sequencer (Applied 
5 Biosystems, Model 373 A, Version 2.0. IS). The cDNA insert of 0.95kb also was excised 
and ligated into the a pTrcHis vector. A Bglll restriction site within the cDNA was used to 
remove that portion of the cDNA that encodes the predicted polypeptide N terminal sequence 
region that is not also found in bacterial P-carotene hydroxylases (Figure 6). A Bglll-Xhol 
fragment was directionally cloned in BamHI-XhoI digested TrcHis vectors. 

1 0 Pigment Analysis 

A single colony was used to inoculate 50 ml of LB containing ampicillin and 
chloramphenicol in a 250-ml flask. Cultures were incubated at 28°C for 36 hours with gentle 
shaking, and then harvested at 5000 rpm in an SS-34 rotor. The cells were washed once with 
distilled H 2 0 and resuspended with 0.5 ml of water. The extraction procedures and HPLC 

15 were essentially as described previously (Cunningham et al, 1994). 

II. Isolation and biochemical anal ysis of an Arabidopsis lvcopene e-cyclase 
Plasmid Construction 

Construction of plasmids pAC-LYC, pAC-NEUR, and pAC-ZETA is described in 
Cunningham et al., (1994). In brief, the appropriate carotenoid biosynthetic genes from 

20 Erwinia herbicola, Rhodobacter capsulatus, and Synechococcus sp. strain PCC7942 were 

cloned in the plasmid vector pACYCl 84 (New England BioLabs, Beverly, MA). Cultures of 
E. coli containing the plasmids pAC-ZETA, pAC-NEUR, and pAC-LYC, accumulate £- 
carotene, neurosporene, and lycopene, respectively. The plasmid pAC-ZETA was 
constructed as follows: an 8.6-kb Bglll fragment containing the carotenoid biosynthetic genes 

25 of E. herbicola (GenBank M87280; Hundle et al., 1991) was obtained after partial digestion 
of plasmid pPL376 (Perry et al., 1986; Tuveson et aL, 1986) and cloned in the BamHI site of 
pACYC184 to give the plasmid pAC-EHER. Deletion of adjacent 0.8- and 1.1-kb BamHI- 
BamHI fragments (deletion Z in Cunningham et al., 1994), and of a 1.1 kB Sall-Sall fragment 
(deletion X) served to remove most of the coding regions for the E. herbicola P-carotene 

30 hydroxylase (crtZ gene) and zeaxanthin glucosyltransferase (crtX gene), respectively. The 
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resulting plasmid, pAC-BETA, retains functional genes for geranylgeranyl pyrophosphate 
synthase (crtE), phytoene synthase (crtB), phytoene desaturase (crtl), and lycopene cyclase 
(crtY). Cells of E. coli containing this plasmid form yellow colonies and accumulate p- 



5 



carotene. A plasmid containing both the lycopene e- and p -cyclase cDNAs of A. thaliana 
was constructed by excising the e-cyclase in clone y2 as a PvuI-PvuII fragment and ligating 



this piece in the SnaBI site of a plasmid (pSPORT 1 from GIBCO-BRL) that already 
contained the p -cyclase (Cunningham et al., 1996). 

Organisms and Crowth Conditions 

E. coli strains TOP 10 and TOP 10 F' (obtained from Invitrogen Corporation, San 
10 Diego, CA) and XL1 -Blue (Stratagene) were grown in Luria-Bertani (LB) medium 

(Sambrook et al., 1989) at 37°C in darkness on a platform shaker at 225 cycles per min. 

Media components were from Difco (yeast extract and tryptone) or Sigma (NaCl). 

Ampicillin at 150 and/or chloramphenicol at 50 ^ig/mL (both from United States 

Biochemical Corporation) were used, as appropriate, for selection and maintenance of 
15 plasmids. 

Mass Excision and Color Complementation Screening of an A. thaliana cDNA IJhrary 

A size-fractionated 1-2 kB cDNA library of A. thaliana in lambda ZAPII (Kieber et 
aL, 1993) was obtained from the Arabidopsis Biological Resource Center at The Ohio State 
University (stock number CD4-14). Other size fractionated libraries were also obtained 

20 (stock numbers CD4-1 3, CD4- 1 5, and CD4-1 6). An aliquot of each library was treated to 

cause a mass excision of the cDNAs and thereby produce a phagemid library according to the 
instructions provided by the supplier of the cloning vector (Stratagene; E. coli strain XL1- 
Blue and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of E. coli 

25 TOPI 0 F* (this strain contained the plasmid pAC-LYC) by incubation of the phagemid with 
the E. coli cells for 15 min at 37°C. Cells had been grown overnight at 30°C in LB medium 
supplemented with 2% (w/v) maltose and 10 mM MgS0 4 (final concentration), and harvested 
in 1 .5 ml microfuge tubes at a setting of 3 on an Eppendorf microfuge (5415C) for 10 min. 
The pellets were resuspended in 10 mM MgS0 4 to a volume equal to one-half that of the 
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initial culture volume. Transforrnants were spread on large (150 mm diameter) LB agar petri 
plates containing antibiotics to provide for selection of cDNA clones (ampicillin) and 
maintenance of pAC-LYC (chloramphenicol). Approximately 10,000 colony forming units 
were spread on each plate. Petri plates were incubated at 37-C for 16 hr and then at room 
temperature for 2 to 7 days to allow maximum color development. Plates were screened 
visually with the aid of an illuminated 3x magnifier and a low power stage-dissecting 
microscope for the rare, pale pinkish-yellow to deep-yellow colonies that could be observed 
in the background of pink colonies. A colony color of yellow or pinkish-yellow was taken as 
presumptive evidence of a cyclization activity. These yellow colonies were collected with 
sterile toothpicks and used to inoculate 3ml of LB medium in culture tubes with overnight 
growth at 37°C and shaking at 225 cycles/min. Cultures were split into two aliquots in 
micro fuge tubes and harvested by centrifugation at a setting of 5 in an Eppendorf 5415C 
microfuge. After discarding the liquid, one pellet was frozen for later purification of plasmid 
DNA. To the second pellet was added 1.5 ml EtOH, and the pellet was resuspended by 
vortex mixing, and extraction was allowed to proceed in the dark for 15-30 min with 
occasional remixing. Insoluble materials were pelleted by centrifugation at maximum speed 
for 10 min in a microfuge. Absorption spectra of the supernatant fluids were recorded from 
350-550 nm with a Perkin Elmer lambda six spectrophotometer. 

Analysis of isolated clones 

Eight of the yellow colonies contained p -carotene indicating that a single gene 
product catalyzes both cyclizations required to form the two p endgroups of the symmetrical 
P -carotene from the symmetrical precursor lycopene. One of the yellow colonies contained a 
pigment with the spectrum characteristic of 6 -carotene, a monocyclic carotenoid with a single 
e endgroup. Unlike the p cyclase, this e-cyclase appears unable to carry out a second 
cyclization at the other end of the molecule. 

The observation that e-cyclase is unable to form two cyclic e-endgroups {e.g. the 
bicyclic e-carotene) illuminates the mechanism by which plants can coordinate and control 
the flow of substrate into carotenoids derived from P -carotene versus those derived from cc- 
carotene and also can prevent the formation of carotenoids with two e endgroups. 

The availability of the A. thaliana gene encoding the e-cyclase enables the directed 
manipulation of plant and algal species for modification of carotenoid content and 
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composition. Through inactivation of the e-cyclase, whether at the gene level by deletion of 
the gene or by insertional inactivation or by reduction of the amount of enzyme formed (by 
such as antisense technology), one may increase the formation of P -carotene and other 
pigments derived from it. Since vitamin A is derived only from carotenoids with p 
5 endgroups, an enhancement of the production of P -carotene versus a -carotene may enhance 
nutritional value of crop plants. Reduction of carotenoids with e-endgroups may also be of 
value in modifying the color properties of crop plants and specific tissues of these plants. 
Alternatively, where production of a -carotene, or pigments such as lutein that are derived 
from a-carotene, is desirable, whether for the color properties, nutritional value or other 

10 reason, one may overexpress the e-cyclase or express it in specific tissues. Wherever 

agronomic value of a crop is related to pigmentation provided by carotenoid pigments the 
directed manipulation of expression of the e-cyclase gene and/or production of the enzyme 
may be of commercial value. 

The predicted amino acid sequence of the A. thaliana e-cyclase enzyme was 

15 determined. A comparison of the amino acid sequences of the p- and e-cyclase enzymes of 
Arabidopsis thaliana (Fig. 13) as predicted by the DNA sequence of the respective cDNAs 
(Fig. 4 for the e-cyclase cDNA sequence), indicates that these two enzymes have many 
regions of sequence similarity, but they are only about 37% identical overall at the amino acid 
level. The degree of sequence identity at the DNA base level, only about 50%, is sufficiently 

20 low such that we and others have been unable to detect this gene by hybridization using the p 
cyclase as a probe in DNA gel blot experiments. 
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We claim: 

1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23 or 25-27. 

5 2. The nucleic acid sequence of claim 1, wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23 or 25-27. 

3. A vector comprising the nucleic acid sequence of claim 1, wherein the nucleic acid 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

10 5 - The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene e-cyclase enzyme activity and 
having an amino acid sequence which is at least 85% identical to one of SEQ ID NOS: 23 or 

15 25-27. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS: 23 or 25-27. 
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AMENDED CLAIMS 

[received by the International Bureau on 15 November 1999 (15.1 1.99); 
original claims 1,2,7 and 8 amended; remaining claims unchanged (1 page)] 
1 . An isolated and/or purified nucleic acid sequence which encodes for a protein having 
lycopene e-cyclase enzyme activity and has an amino acid sequence which is at least 85% 
identical to one of SEQ ID NOS: 23, 25 or 26. 

5 2. The nucleic acid sequence of claim 1, wherein the protein has the amino acid 
sequence of one of SEQ ID NOS: 23, 25 or 26. 

3. A vector comprising the nucleic acid sequence of claim 1 , wherein the nucleic acid 
sequence is operably linked to a promoter. 

4. A host cell which contains the vector of claim 3. 

10 5. The host cell of claim 4, wherein the host cell is selected from the group consisting of 
a bacterial cell, an algal cell, a yeast cell and a plant cell. 

6. The host cell of claim 4, wherein the host cell is a photosynthetic cell. 

7. An isolated and/or purified protein having lycopene €-cyclase enzyme activity and 
having an amino acid sequence which is at least 85% identical to one of SEQ ID NOS: 23, 25 

15 or 26. 

8. The protein of claim 7, wherein the protein has the amino acid sequence of one of 
SEQ ID NOS: 23, 25 or 26! 
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Fl G. 4 A 



Arabidopsis thai i ana epsilon cyclase: 

acaaaaggaaataattag attcctctttctgcttgctataccttgaca 48 

gaacaacataacaatggtgtaagtcttctc gctgtattcgaaattatttggaggaggaac 108 

atggagtgtgttggggctaggaatttcgca gcaatggcggtttcaacatttccgtcatgg 168 
M E C V™A ffn FA AM A V S T F P S w 

agttgtcqaaggaaatttccagtggctaag agatacagctataggaatattcgcttcggt 228 
21 SCR^KFPVVK RYbYKNlRFb 

^ ttgtgta^tgtcagagcta^cggcggcgga a^ttccggta^tgaga^ttgtgtagcggtg 288 

agagaagatttcgctgacgaagaagatttt gcgaaagctggcggttctgagattctattt 348 
61 RSDFADEEDF v E A SRI L F 

gttcaaatgcagcagaacaaagatatggat gaacagtctaagcttgttgataagttgcct 408 
81 VQMQQMKD M D S Q S K L V D K L "P 

cctatatcaactggtgatggtgctttggat catgtggttactggctgtggtcctgctggt 468 
101 P I S I ^ D ^ A L D K V V I ^ C % P A ^ 

ttagccttggctgcagaatcagctaagctt ggattaaaagttggactcattggtccagat 528 
121 LALAAKSAK Lb L K V b L lb P D 

cttccttttactaacaattacggtgtttgg gaagatgaattcaatgatcttgggctgcaa 588 
141 L P F T M M Y b V flf K D K F N D L b L G 

aaatgtattgagcatgtttggagagagact attgcgcacctggatgatgacaagcctatt 648 
161 K C I TK V W RT T I V Y LD u D K P I 

^ accatt^ccgtgcttatggaagagtta^t cgacgtttgctccatgaggagcttttgagg 708 

aggtgtgtcgagtcaggtgtctcgtacctt agctcgaaagttgacagcataacagaagct 768 
201 FCVKSlSVSYL SSKVDSITEA 

tgtgatggccttagacttgttgcttgtgac gacaataacgtcattccctgcaggcttgcc 828 
221 SD^LXLVACD DMMVIPCXLA 

actgttgcttctggagcagcttcgggaaag ctcttgcaatacgaagttggtggacctaga 888 
241 nHsTAASlK LLQYXV^^PR 

gtctgtgcgcaaactgcatacggcgtggag gttgaggcggaaaatagtccatatgatcca 948 
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FIG. 4B 

261 VCVQTAYGVX VXVXNSPYDP 

gatcaaatggttttcatggattacagagat tatactaacgagaaagttcggagcttaqaa 1008 
281 DQMVPMDYRD Y T M X X V R 5L A 

gctgagtatccaacgtttctgtacgccatg cctatgacaaagtcaagactcttcttcgag 1068 
301 AKYPTFLYAM PMTKSRLFFK 

gagacatgtttggcrtcaaaagatgtcatg ccrtttgatttgctaaaaacgaagctcatg 1128 
321 KTCLASKDVM PFDLLKTKLM 

341 TTWg*^ 1188 

atcccagttOTtggttccttgccaaacacc gaacaaaagaatctcgcctttggtgctgcc 1248 
361 IPVGGSLPMT X Q K N L A 

™ g^g^tggtacatcccgcaacagqctat tcagttgtgaoatctttgtctgaagctcca 1308 
381 ASMVMPATGY SVVRSLSXAP 

aaacatgcatcagtcatcgcagagatacta agagaagagactaccaaacagattaacagt 1368 
401 KYASVIAKIL REETTKQINS 

aatatttcaagacaagcttaggatacttta tggccaccagaaaggaaaagacagaqaqca 1428 
421 MISRQAWDTL WPPEFXRQRA 

ttctttctctttggtcttgcactcagagtt caattcgataccgaaggcattagaagcttc 1488 
441 FFLFGLALIV QFD tTTi RS F 

ttccqtactttc±tccgccttccaaaatgg atgtggcaagggtttctaggatcaacatta 1548 
461 FRTPFRLPKW MWQGFLGSTL 

acatcaggagatctcgttctc±ttgcttta tacatgttcgtcatttcaccaaacaatttg 1608 
481 TSGDLVLFAL YMPVISPMML 

agaaaaggtctcattaatcatctcatctct gatccaaccggagcaaccatgataaaaacc 1668 
501 RK G L I N W L I S D P T T A T M I K T 

tatctcaaagtatgatttacttaccaactc ttaggtttgtgtatatatatgccgatttat 1728 

521 Y L K V 

ctgaalaatcgatcaaagaatggtatgtgg gttactaggaagttggaaacaaacacgtat 1788 

agaatctaaggagtgatcgaaatggagacg gaaacgaaaagaaaaaaatcagtctttgtt 1648 

ccgtggctagtg 1868 
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FIG. 5 

gctctttctc ctcctcctct accgatttcc gactccgcct cccgaaatcc 

ttatccggat tctctccgtc tcttcgattt aaacgctttt ctgtctgtta 

cgtcgtcgaa gaacggagac agaattctcc gattgagaac gatgagagac 

cggagagcac gagctccaca aacgctatag acgctgagta tctggcgttg 

cgtttggcgg agaaattgga gaggaagaaa tcggagaggt ccacttatct 

aatcgctgct atgttgtcga gctttggtat cacttctatg gctgttatgg 

ctgtttacta cagattctct tggcaaatgg agggaggtga gatctcaatg 

ttggaaatgt ttggtacatt tgctctctct gttggtgctg ctgttggtat 

ggaattctgg gcaagatggg ctcatagagc tctgtggcac gcttctctat 

ggaatatgca tgagtcacat cacaaaccaa gagaaggacc gtttgagcta 

aacgatgttt ttgctatagt gaacgctggt ccagcgattg gtctcctctc 

ttatggattc ttcaataaag gactcgttcc tggtctctgc tttggcgccg 

ggttaggcat aacggtgttt ggaatcgcct acatgtttgt ccacgatggt 

ctcgtgcaca agcgtttccc tgtaggtccc atcgccgacg tcccttacct 

ccgaaaggtc gccgccgctc accagctaca tcacacagac aagttcaatg 

gtgtaccata tggactgttt cttggaccca aggaattgga agaagttgga 

ggaaatgaag agttagataa ggagattagt cggagaatca aatcatacaa 

aaaggcctcg ggctccgggt cgagttcgag ttcttgactt taaacaagtt 

ttaaatccca aattcttttt ttgtcttctg tcattatgat catcttaaga 
cggtct 
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FIG. 7 

1 ccacgggt:c= gcctccccgt: ttttttccga tccgatcrcc gg^gccgagg 

51 actcagctgt t tgttC gcgc tttctcagcc gtcaccatga ccgattctaa 

101 cgatgct:gga atggatgctg .ttcagagacg actcatgttt gaagacgaat 

151 gcattctcg* tgatgaaaat aatcgtgtgg tgggacatga cactaagtat 

201 aactgtcatc tgatggaaaa gattgaagcH gagaatttac ttcacagagc 

251 rrtrcagrgtg tttttattca actccaagta tgagtAgctt ctccagcaac 

301 ggtcaaaaac aaaggttact ttcccacttg tg^ggacaaa cactt^tgc 

3 51 agccatcctc ttraccg^ga atccgagctt: attgaagaga atgrgc-^gg 

401 tgtaagaaat gccgcacaaa ggaagctttt cgatgagctc ggtattgcag 

451 cagaagatgt accagtcgat gagHtcacHc cctUgggacg catgctt-.ac 

501 aaggcacctt ctgatgggaa atggggagag cacgaagttg actatctact 

551 c^tcatcgrtg cgggatgxga agcttcaacc aaacccagat gaag^ggcrg 

601 agatcaagta cg^gagcagg gaagagctta aggagrtggt gaagaaagca 

65 1 gatgcuggcg atgaagctgn gaaactatct ccatggttca gattggrgg-c 

701 ggataatttc ttgatgaagu ggUgggatca tg^tgagaaa ggaactatca 

7S1 ctgaagctgc agacatgaaa accattcaca agctctgaac tttccataag 

801 ttttggatct tccccttccc ataataaaat taagagatga gacttictart: 

351 gattacagac aaaactggca acaaaatcta ttcctaggat ttttctttgc 

901 tttttattta ettttgitte atctctagtt tagttttcat cttaaaaaaa 

951 aaaa 
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FIG. 8 

x tg « tcttct ttatttaatc t=ccattgac 

ttcc5wtc ttsttcttct tte 

ox ~ at^gaccga CAAAGTTAcc cAAmx 

n;::r cacacaiact — — - 

CGTCTCATGT TTGAGGATGA AXGCArxCTT CTTGAICAAA . 

c!T MCAT GTCASCAACT ATAATT=TCA TcrcA ™" * 

CCAAGAATTT GCTGCACAGG GCTTTTAGTG TATTTTTATT CAACTGGAAG 

"1 TATGAGTTGC rTCTCCAGCA AAGGTCAAAC ACAAAGGTTA CGTTCCCTCT 

«•! AGTGTCGACT AACACTTCTT GCAGCCATCC TCTTTACCC7 GAATCAGAGC 

"1 TTATCCAGGA CAATGCACTA GGTGTGAGGA AXGCTGCACA AAGAAAGCTT 

01 CTGGATGACC TTGGTATTGT AGCTGAAGAT GTACCAGTCG ATGAGTTCAC 

»« TGCCTTGGGA GGTATGCTGT AGAAGGCXGC TTCTGATGGG AAATGGGGAG 

"1 AGCATGAACT TGATTACTTG CTCTTCATCG TGCGAGACGT GAAGGTTCAA 

«1 CCAAACCCAG ATGAAGTAGC TGAGATCAAG TATGTGAGCC GGGAAGAGCT 

'01 GAAGGAGCTG GTGAAGAAAG CAGAXGCAGG TGAGGAAGGT TTGAAACTGT 

"1 CACCAIGGTT GAGATTGGTG GTGGACAATT TCTTGAIGAA GTGGTGGGAT 

301 CATGTTGAGA AAGGAACTTT GGTTGAAGCT ATAGACATGA AAACCATCCA 

851 CAAACTCTGA ACATCTTTTT TTAAAGTTTT TAAAICAATC AACTTTCTCT 

"I TCATCATTTT TATCTTTTCG ATCATAATAA TTTGGGATAT GTGAGAGACT 

«1 TACAAAACTT CCAAGCACCT CAGGCAATAA IAAAGTTTGC GGCCGC 
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FIG. 9 
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CGTGAACTCC 


GCCCAGCAGC 


CCAGCTGTGC 
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GGCCCTTCTC 
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TTTGACGATC 
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GCTGCTGCAA 


CAGCGTGCAC 
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GCTCAAAAAT 


CACCTTCCCA 


AGTGTGTGGA 


CGAACACCTG 


CTGCAGCCAC 
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CCTTTACATG 


GGCAGACCCC 


AGATGAGGTG 


GACCAACTAA 
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CGACGGAACA 


GTACCTGGCG 


CAAAGGCTGC 


TGCCATCCCC 


AAGTTGGAGC 
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ACGAGCTGGG 


GATACCAGCG 
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CGGCXkGCGC 


GTTTCGC7TC 
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AGCGTGAAAG 


CAGAAGCTGC 
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TCTCAGACTG 


AACCTGCAGT 
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AAGGTCAGGT 
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AAAAAAAAAA 


AAAAA 
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FIG. 10 
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FIG. I I A 

1 

HP04 MLRSLLRGLT HIPRVNSAQQ PSCAHARLQF 
HP05 MLRSLLRGLT HIPRVNSAQQ PSCAHARLQF 
ATDP7 MSVSSLFNLP .LIRLRSLA. LSSSFSSFRF 
C.brew. MS.SSMLNFT .ASRIVSLPL LSSPPSRVHL 

AT0P5 TGPPPRFFP IRSPVPRTQL 

S.cerev. . .MTADNNSM PHGAVSSYAK LVQNQTPEDI 

51 

AEDRTDHMRG ASTWAGGQSQ DELMLKDECI 
SEDRTDHMRG ASTWAGGQSQ. DELMLKDECI 
S..GTA.MTD TKDAGMDAVQ RRLMFEDECI 
SSQATT.MGE VVDAGMDAVQ RRLMFEDECI 

T.MTD SNDAGMDAVQ RRLMFEDECI 

SSETSNDESG ETCFSGHDEE QIKLMNENCI 

101 

LPHQPAGLLH RAFSVFLFDD QGRLLLQQRA 
LPHQPAGLLH RAFSVFLFDD QGRLLLQQRA 
ENIEAKNLLH RAFSVFLFNS KYELLLQQRS 
ENIESENLLH RAFSVFLFNS KYELLLQQRS 
EKIEAENLLH RAFSVFLFNS KYELLLQQRS 
ENIE.KGLLH RAFSVFIFNE QGELLLQQRA 

151 

GQTPDEVDQL SQVADGTVPG AKAAAIRKLE 
GQTPDEVDQL SQVADGTVPG AKAAAIRKLE 

RE SELIQDNALG VRNAAQRKLL 

RE SELIDENCLG VRNAAQRKLL 

RE SELI.EENVLG VRNAAQRKLF 

ID...DELGL KGKLDDKIKG AITAAVRKLD 

201 

RLHYCAADVQ PAATQSALWG EHEMDYILFI 
RLHYCAADVQ PAATQSALWG EHEMDYILFI 

RMLY KAPSDGKWG EHELDYLLFI 

RILY KAPSDGKWG EHELDYLLFI 

RMLY KAPSDGKWG EHEVDYLLFI 

RIHY MAPSNEPWG EHEIDYILFY 



50 

KLRSMQMTLM QPSISANLSR 

KLRSMQLL. . 

AHRPLSSIS. PRKLPNFRAF 
PLCFFSPISL TQRFSAKLTF 

FVRAFSAV 

LEEFPEIIPL QQRPN...TR 



LVDVEDNITG 
LVDVEDNITG 
LVDETDRVVG 
LVDENDKVVG 
LVDENNRVVG 
VLDWDDNAIG 
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NTKVTFPLVW 
ATKVTFPLVW 
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HELGIPAHQL 
HELGIPAHQL 
DELGIVAEDV 
DELGI PAEDL 
DELGIVAEDV 
HELGIPEDET 
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HASKLECHKF 
HASKLECHKF 
HVSKYNCHLM 
HESKYNCHLM 
HDTKYNCHLM 
AGTKKVCHLM 

150 

TNTCCSHPLH 
TNTCCSHPLH 
TNTCCSHPLY 
TNTCCSHPLY 
TNTCCSHPLY 
TNTCCSHPLC 

200 

PA.SAFRFLT 
PA.SAFRFLT 
PV.DEFTPLG 
PV.DQFIPLS 
PV.DEFTPLG 
KTRGKFHFLN 



250 

..RANVTL APNPDEVDEV 
. . RANVTL APNPDEVDEV 
..VRDVKV QPNPDEVAEI 
..IRDVNL DPNPDEVAEV 
. . VRDVKL QPNPDEVAEI 
KINAKENLTV NPNVNEVRDF 
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FIG. I I B 

251 



300 



RYVTQEELRQ MMQ PDN GLQWSPWFRI IAARFLERWW ADLDAAI NTn 

RYVTQEELRQ fWQ....PDN GLQWSPWFRI IAARFLERWW ADLDAALNTD 
KYVSREELKE LVKKADAGEE GLKLSPWFRL VVDNFLMKWW DHVEKGTLVE 
KYMNRDDLKE LLRKADAEEE GVKLSPWFRL WDNFLf SSJ DHVEKGS m 
ICYVSREELKE LVKKADAGDE AVKLSPWFRL WNFWKWW DHVEKGTITE 
KWVSPNDLKT MF ADP SYKFTPWFKI ICENYLFNWW EQLDDLSEVE 



301 

KHEDWGTVHH INEA* 
KHEDWGTVHH INEA* 
A.IDMKTIHK L* 
A.ADMKTIHK L* 
A.ADMKTIHK L* 
A.ADMKTIHK L* 
NDRQ...IHR ML* 
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FIG. 12 
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F I G. I4A 

Adonis palaestino £ -cyclase cDNA #5 Length: 1898 

51 SawctcSt mSfSt? t ttac + tgtcgc ^tcttgcaa cacttatatt 
im I S tttcttcttt tctcttcaaa acaacaaact aatgtgagca 

15 ct ?^? aact acttggtgtt cgcaacctca tctcttcttg 

9m ccctgtgtgg acttttggaa caagaaacct tagtagttca aaactagctt 
25 n^nff a £ a tcgatat 9 gt tcttcttgta gagtagattt tcaagtgaga 
™ S gatggtg g aa gcgggag tagaagttct gttgcttata aagagggttt 
35 ^aggatttta tcaaagctgg tggttctgag cttttgtttg 

dm Hn?f 9Ca gcaaacaaa 9 tctatggaga aacaggccaa gctcgccgat 
45 fnS^ Cac caataccttt tggagaatcc gtgatggact tggttgtaat 
5m nSJ tgt "? 5 ct 9Ctggtc tttcactggc tgcagaagct gctaagctag 
551 SS? a f agt tggccttatt ggtcctgatc ttccttttac aaataattat 
ggtgtgtggg aagacgagtt caaagatctt ggacttgaac gttgtatcqa 
am g + ca H ct 599 aaggacacca tcgtatatct tgataatgat gctcctgtcc 
70 S 9 tg catatgga cgagttagtc gacatttgct acatgaggag 
701 ttgctgaaaa ggtgtgtgga gtcaggtgta tcatatctgg attctaaaqt 
751 ggaaaggatc actgaagctg gtgatggcca tagccttgtl gtttg?gala 
801 atgagatctt tatcccttgc aggcttgcta ctgttgcatc tggagcagct 
90 aar^t d t aa t C ttttggagta tgaagtaggt ggccctcgtg tttgtgtcca 
951 gggfggagg ttgaggtgga gaacaatcca tacgatccca 

inm tnll^ 99t attca Jgg ac tacagagact atatgcaaca gaaattacag 
1001 tgctcggaag aagaatatcc aacatttctC tatgtcatgc ccatgtcgcc 
a !i? ag \ c " ttttttgagg aaacctgttt ggcctcaaaa gatglcatgc 
liS Sii Cga i? act g aa 9 a 9 a aaactgatgt cacgattgaa gactctgggt 
120 taattrflll Caaaagttta tgaagaggaa tggtcatata ttcctgttgg 
1251 ccaaacacag agcaaaagaa cctagcattt ggtgctgcag 

1251 caagcatggt gcatccagca acaggctatt cggttgtacg gtcactqtca 
}%} gaa ? ctc J aa aata tgcttc tgtaattgca aigattttga igcaagltaa 
1401 aaarS £3*2? 9 gacaaagta 9 tgcagtaaac atttcaatgc 
1451 ???2£??? 9 caq ^ cttt 99 fcaaaggagc gaaaacgtca aagagcatTc 
1501 JESSE? gattagagc t tattgtgcag ctagatattg aagcaaccag 
1S1 KSi^J* agaaccttct tccgcttgcc aacttggatg tggtggggtt 
lfiOl ffi t ttcactatca tctttcgatc tcgtcttgtt ttccatgtac 
jccj a ^ tgtt J tggcgccaaa cagcatgagg atgtcacttg tgagacattt 
1701 S2S2? a J c ^ ggtg "gttatggt aagagctta? ctcgaaaggt 
17?! a ?^ tCat ? a " a ! taaac tctagtgttt caccaaataa atgaggatcc 
}lm "cgaatgtg tatatgatca tctctatgta tatcctgtac tctaatctca 

85 ffi tttg ata W gtcaaaccgg sss 

i»i aaagtaaatt tattgataca aaagtagttt ttttccttaa aaaaaaaa 
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FIG. I4B 



Adonis palaestina £ -cyclase #5 predicted polypeptide 
TRANSLATE from: 113 to: 1702 Length: 529 amino acids 

1 MELLGVRNLI SSCPWTFGT RNLSSSKLAY NIHRYGSSCR VDFQVRADGG 

51 SGSRSSVAYK EGFVDEEDFI KAGGSELLFV QMQQTKSMEK QAKLADKLPP 

101 I PFGESVMDL VVIGCGPAGL SLAAEAAKLG LKVGLIGPDL PFTNNYGVWE 

151 DEFKDLGLER CIEHAWKDTI VYLDNDAPVL IGRAYGRVSR HLLHEELLKR 

201 CVESGVSYLD SKVERITEAG DGHSLVVCEN EIFIPCRLAT VASGAA5GKL 

251 LEYEVGGPRV CVQTAYGVEV EVENNPYDPN LMVFMDYRDY MQQKLQCSEE 

301 EYPTFLYVMP MSPTRLFFEE TCLASKDAMP FDLLKRKLMS RLKTLGIQVT 

351 KVYEEEWSYI PVGGSLPNTE QKNLAFGAAA SMVHPATGYS VVRSLSEAPK 

401 YASVIAKILK QDNSAYVVSG QSSAVNISMQ AWSSLWPKER KRQRAFFLFG 

451 LELIVQLDIE ATRTFFRTFF RLPTWMWWGF LGSSLSSFDL VLFSMYMFVL 

501 APNSMRMSLV RHLLSDPSGA VMVRAYLER* 
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FIG. I5A 

DMA sequence of potato cDNA (GenBank R27545) obtained frcm Nicholas J. 
Provart 

potato.seq Length: 1378 August 2, 1996 13:06 Type: N Check: 605 
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aacatgtttg 
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1 CI 
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gttactcaaa 
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301 


ctcggggaaa 


351 


aaacagctta 


401 


agcctgatgg 


/I FT 1 

451 


atctttagaa 
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caacacgagt 


rri 

551 


ccattcgatc 


601 


tgtaagaatt 


DDI 


gaggatcttt 


701 


gctagcatgg 


751 


tgaagctcca 


801 


atagcaagaa 


851 


gcttggaaca 


901 


cctatttgga 


951 


catttttccg 


1001 


cttggttcaa 


1051 


gtttattatt 


1101 


tatctgatcc 


1151 


agtaaattcc 


1201 


attcataaca 


1251 


ctaggttaat 
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cttctttttt 


1351 


gtatatataa 



poteps.pep Length: 378 TRANSLATE from: 14 to: 1147 

DEFKDLGLQA CIEHVWRDTI VYLDDDDPIL IGRAYGRVSR HLLHEELLKR 
51 CVEAGVLYLN SKVDRIVEAT NGHSLVECEG DVVIPCRFVT VASGAASGKF 
101 LQYELGGPRV SVQTAYGVEV EVDNNPFOPS LMVFMDYRDY VRHDAQSLEA 
151 KYPTFLYAMP MSPTRVFFEE TCLASKDAMP FDLLKKKLML RLNTLGVRIK 
201 EIYEEEWSYI PYGGSLPNTE QKTLAFGAAA SMVHPATGYS WRSLSEAPK 
251 CAFVLANILR QNHSKNMLTS SSTPSISTQA WNTLWPQERK RQRSFFLFGL 
301 ALILQLDIEG IRSFFRAFFR VPKWMWQGFL GSSLSXADLM LFAFYMFIIA 
351 PNDMRRGLIR HLLSDPTGAT LIRTYLTF* 
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FIG. I5B 

Chimeric lettuce/potato lycopene e-cyclase: converts lycopene to 5- 
carotene, the lettuce cDNA converts lycopene to £ -carotene and the 
potato cDNA does not produce an active enzyne 

(amino acids in lower case are from lettuce and those in uppercase 
are from the potato cDNA; an Avail site in ccmron to the two cDNAs 
was used to construct the chimera) 

1 mecfgarnmt atmavftcpt ftdcnirhkf sllkqrrftn lsassslrqi 

51 kcsaksdrcv vdkqgisvac eedyvkaggs elffvqmqrt ksmesqskls 

101 eklaqipign cildlvvigc gpaglalaae saklglnvgl igpdlpftnn 

151 ygvwqdefig lglegciehs wkdtlvyldd adpirigray grvhrdllhe 

201 ellrrcvesg vsylsskver iteapngysl iecegnitip crlatvasga 

251 asgkfleyel gGPRVSVQTA YGVEVEVDNN PFDPSLMVFM DYRDYVRHDA 

301 QSLEAKYPTF LYAMPMSPTR VFFEETCLAS KDAMPFDLLK KKLMLRLNTL 

351 GVRIKEIYEE EWSYIPVGGS LPNTEQKTLA FGAAASMVHP ATGYSVVRSL 

401 SEAPKCAFVL ANILRQNHSK NMLTSSSTPS ISTQAWNTLW PQERKRQRSF 

451 FLFGLALILQ LDIEGIRSFF RAFFRVPKWM WQGFLGSSLS XADLMLFAFY 

501 MFIIAPNDMR RGLIRHLLSD PTGATLIRTY LTF* 
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FIG. 16 

GAP corparison of Arabidopsis £ -cyclase x potato £ -cyclase (partial) 
blosum62.cnp Gap Weight: 12 Average Match: 2.912 
Length Weight: 4 Average Mismatch: -2.003 
Quality: 1485 Length: 529 

Ratio: 3.929 Gaps: 1 

Percent Similarity: 79.893 Percent Identity: 76.139 
Match display thresholds for the alignment(s) : 
' = IDENTITY : = 2 . = 1 



151 EDEFNDLGLQKCIEHVWRETIVYLDDDKPITIGRAYGRVSRRLLHEELLR 200 

III Mill IMMIhllMMII II IMIIIIMI lllllll: 

1 .DEFKDLGLQACIEHVWRDTIVYLDDDDPILIGRAYGRVSRHLLHEELLK 49 
201 RCVESGVSYLSSKVDSITEASDGLRLVACDDNNVI PCRLATVASGAASGK 250 

MINI II. MM I II-- 1 II h Mill 1 1 1 1 1 1 1 1 1 1 

50 RCVEAGV LY LNSKVDRI V EATNGHS LVEC EGDVV I PCRFVTVASGAASGK 99 

251 LLQYEVGGPRVCVQTAYGVEVEVENSPYDPDQMVFMDYRDYTNEKVRSLE 300 

MM. Mill IMIMIMIhUMI f II 1 1 1 1 1 ! MM 
100 FLQYELGGPRVSVQTAYGVEVEVDNNPFDPSLMVEMDYRDYVRHDAQSLE 149 

301 AEYPTFLYAMPMTKSRLFFEETCLASKDVMPFDLLKTKLMLRLDTLGIRI 350 
III 



150 AKYPTFLYAMPMSPTRVFFEETCLASKDAMPFDLLKKKLMLRLNTLGVRI 199 
351 LKTYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSLSEAP 400 

• 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 M 1 1 II M 1 1 1 1 II 1 M 1 1 1 1 

200 KEIYEEEWSYIPVGGSLPNTEQKTLAFGAAASMVHPATGYSVVRSLSEAP 249 
401 KYASVIAEI LREETTKQI . ! . . . NSN I SRQAWDTLWPPERKRQRAFFLFG 445 

I I hi III: -I • Ml IMMMI MUM. Mill 

250 KCAFVLANILRQNHSKNMLTSSSTPSISTQAWNTLWPQERKRQRSFFLFG 299 
446 U\LIVQR)TEGIRSFFRTFFRLPKWMW(^FLGSTLTSGDLVLFALYMFvi 495 

MUM I IMMMI IMMIIIMMIMM. IIMM IMM 

300 LALI LQLDI EGI RSFFRAFFRVPKWMWQGFLGSSLSXADLMLFAFYMFI I 349 

496 S PNN LRKGL I NH LI SDPTGATMI KTY LKV 524 

- 1 1 - : 1 : 1 1 ! IIMIMIIhhlll 
350 APNDMRRGLI RHLLSDPTGATLI RTYLTF 378 
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FIG. I7A 



Adonis palaestina Ipil 

1 attcatcttc agcagcgctg tcgtactctt tctatatctt cttccatcac 

51 taacagtagt cgccgacggt tgaatcggct attcgcctca acgtcaacta 

101 tgggtgaagt cactgatgct ggaatggatg ctgttcagaa gcggctcatg 

151 ttcgacgacg aatgtatttt ggtggatgag aatgacaagg tcgtcgggca 

201 tgattccaaa tacaactgtc atttgatgga aaagatagag gcagaaaatt 

251 tgcttcacag agccttcagt gttttcttgt tcaactcaaa atatgaattg 

301 cttcttcagc aacgatccgc cacaaaggta acattcccgc tcgtatggac 

351 aaacacatgt tgcagtcatc ctctctttcg tgattccgag ctcatagaag 

401 aaaattatct cggtgtacga aacgctgcac aaagaaagct tttagacgag 

451 ctaggcattc cagctgaaga tgtcccagtt gatgaattta ctcctcttgg 

501 tcgcattctt tacaaagctc catctgacgg caaatgggga gagcacgaat 

551 tggactatct cctatttatt gtccgagatg tgaaatacga tccaaaccca 

601 gatgaagttg ctgatgctaa gtatgttaat cgcgaggagt tgagagagat 

651 actgagaaaa gctgatgctg gtgaagaggg actcaagttg tctccttggt 

701 ttagattggt tgttgataac tttttgttca agtggtggga tcatgtagag 

751 cagggtacga ttaaggaagt tgctgacatg aaaactatcc acaagttgac 

801 ttaagaggac ttctctcctc tgttctacta tttgtttttt gctacaataa 

851 gtgggtggtg ataagcagtt tttctgtttt ctttaattta tggcttttga 

901 atttgcctcg atgttgaact tgtaacatat ttagacaaat atgagacctt 

951 gtaagttgaa tttgaggctg aatttatatt tttgggaaca taataatgtt 

1001 aa 
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FIG. I7B 



Adonis palaestina Ipi2 

1 ttttaaagct ctttcgctcc accaccatca aagccagcca aatttctctg 

51 tacaaaagtt aaaaacaccg ctttgggctt tggcccctcc atatcggaat 

101 ccttgtttac gatacgcatc taaaccagta attctcggtt ttaatttgtt 

151 tcctaaatta ggcccctttc cggaatcccg agaattatgt cgtcgatcag 

201 gattaatcct ttatatagta tcttctccac caccactaaa acattatcag 

251 cttcgtgttc ttctcccgct gttcatcttc agcagcgttg tcgtactctt 

301 tctatttctt cttccatcac taacagtcct cgccgagggt tgaatcggct 

351 gttcgcctca acgtcgacta tgggtgaagt cgctgatgct ggtatggatg 

401 ccgtccagaa gcggcttatg ttcgacgatg aatgtatttt ggtggatgag 

451 aatgacaagg tcgtcggaca tgattccaaa tacaactgtc atttgatgga 

501 aaagatagag gcagaaaact tgcttcacag agccttcagt gttttcttat 

551 tcaactcaaa atacgagttg cttcttcagc aacgatctgc aacgaaggta 

601 acattcccgc tcgtatggac aaacacctgt tgcagccatc ccctcttccg 

651 tgattccgaa ctcatagaag aaaattttct cggggtacga aacgctgcac 

701 aaaggaagct tttagacgag ctaggcattc cagctgaaga cgtaccagtt 

751 gatgaattca ctcctcttgg tcgcattctt tacaaagctc catctgacgg 

801 aaaatgggga gagcacgaac tggactatct tctgtttatt gtccgagatg 

851 tgaaatacga tccaaaccca gatgaagttg ctgacgctaa gtacgttaat 

901 cgcgaggagt tgaaagagat actgagaaaa gctgatgcag gtgaagaggg 

951 aataaagttg tctccttggt ttagattggt tgtggataac tttttgttca 

1001 agtggtggga tcatgtagag gaggggaaga ttaaggacgt cgccgacatg 

1051 aaaactatcc acaagttgac ttaagagaaa gtctcttaag ttctactatt 

1101 tggtttttgc ttcaataagt ggatggtgat gagcagtttt tatgcttcct 

1151 ttaattttgg cttttcaatt tgctttatgt gttgaacttg taacatattt 

1201 agtcaaatat gagaccttgt gagttgaatt tgaggttata tttatagttt 

1251 tgggaacata aaaaaaaaaa 
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FIG. I8A 

"TTtfg^ 9cggcagtcc 
51 gatgccgcga tgcttcgttc gttgctcaga ggcctcacgc atatcccccg 
101 cgtgaactcc gcccagcagc ccagctgtgc acacgcgcga ctccagttta 
151 agctcaggag catgcagatg acgctcatgc agcccagcat ctcagccaat 
201 ctqtcqcgcg ccgaggaccg cacagaccac atgaggggtg caagcacctg 
251 ggcaggcggg cagtcgcagg atgagctgat gctgaaggac gagtgcatct 
301 tggtggatgt tgaggacaac atcacaggcc atgccagcaa gctggagtgt 
351 cacaagttcc taccacatca gcctgcaggc ctgctgcacc gggccttctc 
401 tgtgttcctg tttgacgatc aggggcgact gctgctgcaa cagcgtgcac 
451 gctEaaaaat caccttccca agtgtgtgga cgaacacctg ctgcagccac 
501 cctttacatg ggcagacccc agatgaggtg gaccaactaa gccaggtggc 
551 cgacggaaca gtacctggcg caaaggctgc tgccatccgc aagttggagc 
601 acgagctggg gataccagcg caccagctgc cggcaagcgc gtttcgcttc 
651 ctcacgcgtt tgcactactg tgccgcggac gtgcagccag ctgcgacaca 
701 atcagcgctc tggggcgagc acgaaatgga ctacatcttg ttcatccggg 
751 ccaacgtcac cttggcgccc aaccctgacg aggtggacga agtcaggtac 
801 qtqacgcaag aggagctgcg gcagatgatg cagccggaca acgggctgca 
851 ltggt?gccg tggtttcgca tcatcgccgc gcgcttcctt gagcgttggt 
901 gggEtgacct ggacgcggcc ctaaacactg acaaacacga ggattgggga 
951 acggtgcatc acatcaacga agcgtgaaag cagaagctgc aggatgtgaa 
inni aacacatcat qgggtggaat tgcgtacttg gcagcttcgt atctcctttt 
1051 tctgagaSg SStgcagt caggtcccac aaggtcaggt aaaatggctc 
1101 gatlliatgt accgtcactt tttgtcgcgt atactgaact ccaagaggtc 
1151 aaaaaaaaaa aaaaa 
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FIG. I8B 

Haematococcus pluviolis Ipi2 
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FIG. I9A 

Lactuca sativa Ipil 

1 tgccaaaatg ttgaaatttc ccccttttaa aaccattgct accatgatct 

51 cttctccata ttcttccttc ttgctgcctc ggaaatcttc tttccctcca 

101 atgccgtctc tcgcagccgc tagtgttttc ctccaccctc tttcgtctgc 

151 cgctatgggc gattccagca tgqatgctqt ccagcgacgt ctcatgttcq 

201 atgacgaatg cattttggtg gatgagaatg acaaagtggt tggccatgat 

251 actaaataca attgtcattt gatggagaag attgaaaagg gaaatatqct 

301 acacagagca ttcagtgtgt tcttgttcaa ctcgaaatat gaattactcc 

351 ttcagcaacg ttctgcaacc aaggtgactt tccctttggt atggacaaac 

401 acgtgttgca gccatccact atacagggag agtgagctta ttgacgaaaa 

451 cgcccttggg gtgaggaatq ctgcacagag gaaqctcctg gatgaactcg 

501 gcatccctgg agcagatgtt ccgqttgatg agttcactcc attgggtcgc 

551 attctataca aggccgcatc gqatqgaaag tqqggagaac atgaacttqa 

601 ttacctgctg tttatggtac gtgatgttgg tttggatccg aacccagatg 

651 aagtgaaaga tgtaaaatat gtgaaccgqg aagagctgaa ggaattqgta 

701 aggaaggcgg atgctggtga agaqqgtgtg aagctgtccc cqtggttcaa 

751 attgattgtc gataatttct tgtttcagtg gtgqgatcga ctccataagg 

801 gaaccctaac cgaaqctatt gatatgaaaa caatccacaa actcacataa 

851 aaacactaca ctagtaqgag agaggattat atqaqatatt tgttatatgt 

901 gaaattgaaa ttcagatgaa tgcttgtatt tatttctatt tggacaaact 

951 tcaacttctt tttgctacct tatcagaaaa aaaaa 

FIG. I9B 

Lactuca sativa Ipi2 

1 tattcgcttc aaaatctctt ccattaactg ctcaaatctc caccttcgcc 

51 ggtcttaatc tccgccggcg cactttcacc accataaccg ccgccatgqg 

101 tgacgattcc ggcatggacg ctgtccagag acgtctcatg tttgatgatg 

151 aatgcatttt ggttgatgaa aatgacaatg ttcttgggca tgataccaaa 

201 tacaattgtc acttgatgga gaagattgag aaagataatt tqcttcatag 

251 agcattcagt gtatttttat tcaattcaaa atacgaatta ctccttcagc 

301 aaaggtcaga aaccaaggtg acatttcctt tggtatggac aaacacctgt 

351 tgcagccatc cactatacag agaatcggag ttaattcccg aaaatqccct 

401 tggggtcaga aatgctgcac agaggaagct tctaqatgaa ctcggtatcc 

451 ctgctgaaga tgttccaqtt gatgagttca caactttagg tcgcatgttg 

501 tacaaggctc catctgatgg aaaatggggt gaacatgaag ttgattacct 

551 actcttcctc gtgcgtgacg ttgccgtgaa cccaaaccct gatgaggtgg 

601 cggacattag atacgtgaac caagaagagt taaaagagtt actaaggaag 

651 gcqgatgcgg gtgaggaggg tttgaaattg tccccatggt ttaggctagt 

701 ggtggacaac ttcttgttca aatggtggga tcatgtccaa aaggggacac 

751 tcaatgaagc aattgacatg aaaaccattc ataaqttgat atgaaaaatg 

801 gttaatattt atqqtggtgg tttqgaqcta ataatttgtq tgttcaagtc 

851 tcggtccttc tttttttaac gttttttttt tttcttttat tgggagtgtt 

901 tattgtgtac ttgtaacgta ggccctttgg ttacgcttta agagtttaat 

951 aaagaaccac cgttaattta aaaaaaaaaa aaaaaaaa 
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FIG. 2 0 

Chlamydcmonas reinhardtii Ipil 

(Note: the isomerase cDNA probably ends at ca. base 1103; the 
second half of the cDNA is similar to extensin and other 
hydroxyproline-rich structural proteins) 

1 ggcacgagct cgagtttgtt ttaccatgac atcgggaatt tggaagcttg 
51 aactacctca attactcaag taactcgcgg caacacattt cgcgcgccat 
101 cgctgttttc tctgctccaq ctaccgagca gcattgcttt agatcgcttt 
151 gatgtcataa actcccactt atatgagatc cagtttcatc gagcccaagc 
201 ccagagcgca acctgtctta agccgcggca qggcgtccat gcgcctcgcg 
251 caaagccgtg ctctcgttgc gcgtgtcagc tccgccctgt ggccgggagc 
301 aggactttca caggctcaaa gcgttgcggt gcgaatqgcg agttcgtcaa 
351 cctgggaagg cacgggcctg agccaggatg acttcatgca gcgggacgag 
401 tgcttggtgg tggacgagca ggaccggctg ctaggcaccg ccaacaaqta 
451 cgactgccac cgcttcgaqg cqgccaaggg ccagccctgc ggccgcctqc 
501 accgcgcctt ctccgtgttc ctqttcaqcc ccgacggccq actgctgctg 
551 cagcagcgcg cagccagcaa ggtgacgttc ccgggtgtgt gqaccaacac 
601 ctgctgctcg cacccgctgg cgggccaggc gccqgacgag gtggacctqc 
651 cggcggcggt agcctcgggc caggtqccgg gcatcaaggc qgcggcggtg 
701 cgcaagctgc agcacgagct gqgqataccg ccggagcagg ttcccgcctc 
751 ctccttctcc ttcctcacgc gtctgcacta ctgcgccgcc gacaccgcca 
801 cgcacggccc ggcggcggag tgqggcgagc acgaggtgga ctacgtgctq 
851 ttcgtgcggc cgcagcagcc cgtcagcctg caqcccaacc cagacgaggt 
901 ggacgccacg cgctacgtga cgctgccqga gcttcaqtcc atgatggcgg 
951 accccggcct cagctggagc ccctggttcc gcatcctggc cacacagccc 
1001 gccttcctgc ccgcctggtg gggcgacctq aagcggcgct ggcgcccggg 
1051 cggcagccga ctgtaggact ggggcaccat ccaccgcgtc atgtgaagaa 
1101 aaaggggaag cagqggcgqg agcqgqggat gaatgggaat gtgaatgcga 
1151 ttgtgatgcg gcgtgggatg agqtctgaag acagggggaa aatcgggggg 
1201 cgggcgtgag cgtgtgtgta cgtgagcqac aaagccggga ggcqqaccgc 
1251 gcgatgggta catqtqtqtg cggagggtcg gtgggtcggt cggttgcgcg 
1301 gcatagcgtg ttgtgtgtgt gcggctgcgc gggtatgtqg gcacccgggc 
1351 acggaggaga aggcacacqc aggtggcgcg gaggtgtgtc aggggccatg 
1401 ggcgggcctc actcctggtc gtgcccagtg gtctcgtggg cagaqtggca 
1451 ggggctgcac ccatatgagc ggcgcactgc cgcgctgggc taagtcctta 
1501 tcacttggtg aggtgqggcg aggtggctgt gggcggcggg cgcagtggca 
1551 gaaggacacg gtqtgtgagc ggtggagctc tggccgtgcc ggccgtgagg 
1601 ggcggatagc gatatgacgt tgtgcttggc cgctgtaatg cgggagaatg 
1651 tgcaggccgc gagaagcggg cggtggcagg aggccgcagg ctgcagcacc 
1701 cgttggggag gtgccgcctg caggcgcggc gccgggcggg cctgagtaat 
1751 gggcgcctga gtagtggcgg ccacaggagg cgcaggaggc agcagcagga 
1801 ggacgagctg gagggacccg ttggcaaccc aaggttgcgc gtgtaacata 
1851 gtggccatac aaaaaaaaaa aaaa 
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F I G. 2 I A 

Tagetes erecta Ipil 

1 ccaaaaacaa ctcaaatctc ctccgtcgct cttactccqc catgggtgac 

51 gactccggca tggatgctgt tcagcgacgt ctcatgtttq acgatgaatg 

101 cattttggtq gatgagtgtg acaatgtggt gqgacatgat accaaataca 

151 attgtcactt gatggagaag attgaaacag gtaaaatgct gcacagagca 

201 ttcagcgttt ttctattcaa ttcaaaatac gagttacttc ttcagcaacg 

251 gtctgcaacc aaggtgacat ttcctttagt atggaccaac acctgttgca 

301 gccatccact ctacagagaa tccgagcttg ttcccgaaaa cqcccttqga 

351 gtaagaaatg ctgcacagag gaaqctgttg gatgaactcg gtatccctgc 

401 tgaagatgtt cccqttgatc agtttactcc tttaggtcgc atgctctaca 

451 aggctccatc tgatggaaag tgqggagaac atgaacttqa ctacctactt 

501 ttcatagtga gagacgttgc tgtaaacccg aacccagatg aagtggcgga 

551 tatcaaatat gtganccang aagagttaaa ggagctqcta aqqaaagcaq 

601 atqcggqgga qgaqggtttg aagctgtctc catggttcag gttagtqgtt 

651 qataacttct tgttcaagtg gtgqgatcat gtqcaaaagg gtacactcac 

701 tgaagcaatt gatatgaaaa ccatacacaa gctgatatag aaacacaccc 

751 tcaaccgaaa aqttcaagcc taataattcg qgttqgqtcg ggtctaccat 

801 caattgtttt tttcttttaa gaaqttttaa tctctatttg aqcatgttga 

851 ttcttgtctt ttgtqtgtaa qattttgqqt ttcqtttcaq ttgtaataat 

901 gaaccattga tggtttgcaa tttcaagttc ctatcgacat gtagtgatct 

951 aaaaaa 

FIG. 2 I B 

Oryza sative Ipil 

1 cctccctttg cctcgcgcag aggcggccgc gccttctccg ccgcqaggat 

51 ggccggcgcc gccgccgccg tggaggacqc cggqatggac gaggtccaga 

101 agcggctcat gttcgacgac gaatgcattt tgqtggatga acaagacaat 

151 gttgttggcc atgaatcaaa atataactgc catctqatqg aaaaaatcga 

201 atctgaaaat ctacttcata gggctttcag tgtattcctq ttcaactcaa 

251 aatatgaact cctactccag caacgatctq caacaaaggt tacatttcct 

301 ctagtttgga ccaacacttg ctgcagccat cctctgtacc gtgagtctga 

351 gcttatacag gaaaactacc ttggtqttag aaatgctgct cagaggaaqc 

401 tcttggatga gctgggcatc ccagctgaag atgtgccaqt tgaccaattc 

451 acccctcttg gtcggatgct ttacaaqgcc ccatctgatg gaaaatgggg 

501 tgaacacgag cttgactacc tgctgttcat cgtccgcgac gtgaaggtag 

551 tcccgaaccc ggacgaagtg gccgatgtga aatacgtgag ccgtgagcaq 

601 ctgaaggagc tcatccgcaa aqcqgacgcc gqagaqgaag gcctgaagct 

651 gtctccctgg ttccggctgg ttgttgacaa cttcctcatg ggctggtgqg 

701 atcacgtcga gaaaggcacc ctcaacgagq ccgtggacat ggagaccatc 

751 cacaagctga agtaaggact gcgatgttgt ggctggaaag aatgatcctg 

801 aagactctgt tcttgtgctg ctgcatatta ctcttaccag qqaagttqca 

851 gaagtcagaa gaagcttttg tatgtttctg ggtttggaqc ttqqaaqtqt 

901 tgggctctgc tgactgagag attcccttat agagtgtcta tgttaattta 

951 gcaaacttct atattataca tqattaqtta attgttcggt gtctgaataa 

1001 agaacaatag catgttccat gtttatttgc t 
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Comparison using GAP program of the Genetics Computer Group 



Gap Weight: 50 

Length Weight: 3 

Quality: 17392 

Ratio: 9.411 

Percent Similarity: 95.331 



Average match 
Average Mismatch 
Length 
Gaps 

Percent Identity 



Match display thresholds for the al ignment(s) : 
| = IDENTITY : = 5 . = 1 



10.000 
0.000 
1904 
3 

95.331 



Adonis palaestina e -cyclase, #3 x Adonis paloestina £ -cyclase #5 
1 gagagaaaaagaqtgttatattaatqttactgtcgcattcttgcaacac! 49 

in iiijiii Jill 1 1 JTj 1 1 ijT i it i JJi J JiT r 1 1 f 1 1 

1 aaaggagtgttctattaatgttactgtcgcattcttgcaacact 44 

• • • • • 

50 .atattcagactccattttcttgttttctcttcaaaacaacaaactaatq 98 

.,1111111 lllllllllllll Jill II III I II II II 1 1 II I Mill? 

45 tatattcaaactccattttcttcttttctcttcaaaacaacaaactaatg 94 
99 tqa.cgqaqtatctaqctatqgaactacttqgtqttcgcaacctcatctc 147 

ill I TIT) I I Till J T T 1 1 1 J 1 1 J J TT IT J J I T 1 1 1 1 1 1 1 1 1 1 1 1 

95 tgagcagagtatctggctatggaactacttggtgttcgcaacctcatctc 144 

• • • • • 

148 ttcttgccctgtctggacttttggaacaagaaaccttagtagttcaaaac 197 

Ijipiljij ITT 1 1 J J J JTT 1 1 11 IT 1 1 1 1 1 It ITJ 1TJ1 1M 

145 ttcttgccctgtgtggacttttggaacaagaaaccttaqtagttcaaaac 194 



• • • • • 

198 tagcttataacatacatcgatatggttcttcttgtagagtagattttcaa 247 

J ITI I 111 1 1 1 1 II I 111 T 111 iTriJ I J J I J JTI ITITJ I T I J J J J 

tagcttataacatacatcgatatggttcttcttgtagagtagattr: 



195 tagcttataacataca 
248 qtqaqgqctqatqqtqqaaqc 
245 

295 gggttttgtggatgaagagga 



?W?iW 

gtgagagctgatggtggaagcgggagtagaagttctgttgcttataaag 



caa 244 
ataaaga 297 

Mill 0QA 

aaaga 294 



ta 

i 

a 



caaagctggtgg 



tctgagcttt 347 
xtgagcttt 344 



ill 



348 tgtttgtccaaatgcagcaaacaaagtctatggagaaacaggccaagctc 397 

w 1T1J.JTI 1 1 1 1 11TI ITI 1 1 1 1 1 1 ITJ I J IfTTITI 1 1 1 ITTI 1 1 ITI J L , 

345 tgtttgtccaaatgcagcaaacaaagtctatggagaaacaggccaagctc 394 
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FIG. 23 B 



398 m m mm Hrmrnff mf mrnim 447 

gccgataaqttqccaccaataccttttggagaatra 



gccga 

aagttgccaccaataccttttgga^ 444 

445 tgtaataggttgtggacctgctggtcUtcactggctgcagaag^ 494 
498 ??ctaaacttaaM^ 547 

495 WlIilUM 544 

545 aattatggtgtgtgggaagacgagttcaaagatcttggacttgaacgttg 594 

' 9 nitfn» 

595 tatcgagcatgcttggaaggacaccatcgtatatcttgataatgatgctc 644 

2 697 

645 ctgtccttattggtcgtgcatatggacgagttagtcgacatttgctacat 694 

698 IffM^^ 747 
695 gaggagttgctgaaaaggtgtgtggagtcaggtgtatcatatctggattc 744 

748 jtrrmfTfTTOTTf^ 797 

745 taaagtggaaaggatcactgaagctggtgatggccatagccttgtagttt 794 

* • • • • 

798 qtqaaaacqacatctttatcccttqcaggcttgctactgttgcatctgqa 847 

7 « I li 1 1 'J 1 1 'III 1 ! 1 1 'P ITTIJJTII 1 1 IT) III I II ITT I 

795 gtgaaaatgagatctttatcccttgcaggcttgctactgttgcatctgga 844 

2 TT TTTTTTTTTT TTT ? TTT TTTT T?TT?TTT t*TTT T T ? T T TT T ? 897 

845 gcagcttcagggaaacttttggagtatgaagtaggtggccctcgtgtttg 894 
898 tqtccaaactqcttatqqtqtqqaqqttqaqqtqqaqaacaatccatacg 947 



895 tgtccaaaccgcttatggggtggaggttgaggtggagaacaatccatacg 944 
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F 16. 23C 



tcccaac 



995 



• • * 

ggtatttatggactacagagactatatgcaacagaaa 997 

., „..,ttiiii imiiiiinmiiiiimiiiiliii QQa 

945 atcccaacttaatggtattcatggactacagagactatatgcaacagaaa 994 

• • • • • 

998 ttacagtgctcggaagaagaatatccaacatttctctatgtcatgcccat 1047 

111 i n iTiiiTTi iti it i ij in 1 1 1 1 1 i n 1 1 1 ) i]Ti i iiTm 1 1 

:tacagtgctcggaagaagaatatccaacatttctctatgtcatgcccat 



1044 



• • • • • 

1048 gtcgccaacaagacttttttttgaggaaacctqtttqqcctcaaaaqatq 1097 

^ Tl ITI II II I III llllJlJ J ITITTI 1 1 1 IITJJJTTI II 111 I ITIlT inM 

1045 gtcgccaacaagacttttttttgaggaaacctgtttggcctcaaaagatg 1094 

• • * * * _ 

1098 ccatgcctttcgatctactgaagagaaaactaatgtcacqattqaaqact 1147 

I II IT 1 1 II IT Hill I JTI IT IT 1 1 1 1 11 I1TIIIITII TIITII) 

1095 ccatgccattcgatctactgaagagaaaactgatgtcacgattgaagact 1144 

• • • • • 

1148 ctgggtatccaagttacaaaaatttatgaagaggaatggtcttatattcc 1197 

I JTTT J 1 1 1 1 1 1 Til 111111 II 1 1 II 1 mil II ITT 1 1 II mill _ M 

1145 ctgggtatccaagttacaaaagtttatgaagaggaatggtcatatattcc 1194 

• • • • • 

1198 tgttgggggttctttaccaaacacagagcaaaagaacctaqcatttqqtq 1247 

1T11TT TTilllli I M 1 1 1 1 1 UT IT M 1 1 IT 1 1 1 II IT M lilTTiT 

1195 tgttggtggttctttaccaaacacagagcaaaagaacctagcatttggtg 1244 

1248 m c \ i ft! i 111 ffl ftiMMM'p 1297 

iqqctattcqgttgtacggt( 



ca 1294 



1245 ctgcagcaagcatggtgcatccagcaacaggc 
1298 ctatcagaaqctccaaaatatgcttctqtaattqcaaaqattttqaaqca 1347 

II II ITI IT I II 1 1 1 1 111 III II 111 111 IllIIIllllTII 

1295 ctgtcagaagctccaaaatatgcUctgtaattgcaaagatttt 1344 

• • • * 

1348 agataactctqcatatqtqqtttctqqacaaaqcaqtqcaqtaaaca^ 1397 

llllMIIIIII J I )T ITTJ TT II I III I II] ll 11)1 1 Ml]]) 
1345 agaUactctgcgtatgtggtUctggacaaagtagtgcagtaaacattt 1394 

• • * * 

1398 caatqcaagcatggagcagtctttqqccaaaqqaqcqaaaacqtcaaaqa 1447 

IIIITIIITII llllllllllillllll I ITT IT IT 1 1 1 1 1 TT 1 1 1 IT I 1AM 

1395 caatgcaagcatggagcagtcUtggccaaaggagcgaaaacg 1444 
1448 gcattctttcttttcOT^ 1497 
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1498 aaccagaacgttctttagaaccttcttccgcttgccaacttqqat 

iiiimii iimim 1 1 1 ill ill 

1495 aaccagaacattctttagaaccttcttccqcttq 



xcgcttgccaac 




1548 qqqqtttccttqqqtcttcactatcatctttcgatcttgtattgttttcc 1597 

1W TTTTJ J J 1 1 J JTtTJLI J J 1 1 1 J IJI I J I J JJ.ITI 1 1 J II 1 III III I 

1545 ggggtttccttgggtcttcactatcatctttcgatctcgtcttgtttt( 



:cc 1594 

1598 atgtacatgtttgttttggccccgaacagcatgaggatgtcacttgtgag 1647 

IJillNlllIJllllJllI II IIIITIIITITTIJTjllMITlTlT,^ 

1595 atgtacatgtttgttttggcgccaaacagcatgaggatgtcacttgtgag 1644 

• • • • • 

1648 acatttgctttcagatccttctggtgcagttatggttaaagcttacctcq 1697 

^^.^ 1 1 1 JIJTIJ 1 1 1 ITI J I IJJ I JTTJTI IT1 1 IITTJ I ITIIJI I II IT 

1645 acatttgctttcagatccttctggtgcagttatggtaagagcttacctcg 1694 

• • • * • 

1698 aaaggtaatc. . .tgttttatgaaactatagtgtctcattaaataaatqa 1744 

,™ Mliill II, 1 1 llllJlH) III ITU 1 1 11111.11111 „ AA 

1695 aaaggtagtctcatctattattaaactctagtgtttcaccaaataaatga 1744 



1 745 ggatccttcgt atatgtatatgatcatctctatgtat 

„, c nillllllT II ITIIIIITI]IIIIIIJIITII 

1 745 ggat ccttcgaa tgtgt ata tgatcatctctatgt at 



gatcatctctatgtatatcctatattcta 1794 



cctgtactcta 1794 



1795 a - :c :ca - :aaag :aatcgaaaat - :cattga" :agaaaaaaaaaaaaaaaaaa 1844 
1795 atctcataaagiaaatgccgggtttgataitgttgtgtcaaaccggccaa 1844 

1845 aaaa ! ! ! ! ! 1848 

1845 tgatataaagtaaatttattgatacaaaagtagtttttttttttaaaaaa 1894 



FIG. 23 D 
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GAP program of Genetics Computer Group FIG. 24 A 
blosum62.cmp 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch: -2.003 
Quality: 2728 Length: • 530 

Ratio: 5,147 Gaps: 0 

Percent Similarity: 99,623 Percent Identity: 99.057 
Match display thresholds for the al ignment(s) : 

= IDENTITY : = 2 . = 1 



Adonis palaestina c -cyclase #3 x Adonis polaestina £ -cyclase #5 
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• • • • 

MELLGVRNLISSCPVWTFGTRNLSSSKLAYNIHRYGSSCRVDFQVRADGG 
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IIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIII 
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50 


51 


• • • • • 
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200 


151 


200 
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CVESGVSYLNSKVERITEAGDGHSLVVCENDI FI PCRLATVASGAASGKL 


250 
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IIIIIIIIMIIIIIlllllllllllllhlllllllllllllllllll 

CVESGVSYLDSKVERITEAGDGHSLVVCENEI F I PCRLATVASGAASGKL 


250 


251 


LEYEVGGPRVCVQTAYGVE VEVENNPYDPN LMV FMDYRDYMQQKLQCSEE 

llllllllllllllllllllllllllllllllllllllllllllllllll 
LEYEVGGPRVCVQTAYGVEVEVENNPYDPNLMV FMDYRDYMQQKLQCSEE 


300 


251 


300 


301 


EYPTFLYVMPMSPTRLFFEETCLASKDAMPFDLLKRKLMSRLKTLGIQVT 
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EYPTFLYVMPMSPTRLFFEETCLASKDAMPFDLLKRKLMSRLKTLGIQVT 


350 


301 


350 
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351 KIYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGY5VVRSLSEAPK 400 

I • 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1| 1 1 1 1 I 1 1 I 1 1 I 1 1 1 1 J I 1 1 1 1 1 II 
351 KVYEEEWSYIPVGGSLPNTEQKNLAF(WV^MVHPAT6YSVVRSLS^PK 400 

401 }^yMF L ^ QDNSAYm ^ 450 
401 YASVIAKILKQDNSAYVVSGQSSAVNISMQAWSSW 450 

451 ^H VQLDI ^ TRTFFRTFFRL ™^ 500 
451 LELIVQLDIEATRTFFRTFFRLPTl^ 50 o 

501 APNSMRMSLVRHLLSDPSGAVMVKAYLER* 530 
501 APNSMf^LVjULL^D^ 530 

FIG. 24B 
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FIG. 28 A 

GAP of: Arabidopsis epsilon cyclase to Lettuce epsilon cyclase 

Gap Weight: 12 Average Match: 2.912 

Length Weight: 4 Average Mismatch:. -2.003 

Quality: 1837 Length: 534 

Ratio: 3.499 Gaps- 3 

Percent Similarity: 76.381 Percent Identity: 69.905 
Match display thresholds for the al ignment(s) • 
| = IDENTITY : = 2 . = 1 

Arabidopsis x Lettuce 

1 MOTGARNF.AAMAVSTFPSW. . .SCRRKFPVVKRYSYRNIRFGLCSVRA 46 

III llll I III I I • • I || ..|. : I: 
1 MECFGARNMTATMAVFTCPRFTDCNIRHKFSLLKQRRFTNLSASSSLRQI 50 

47 SGGGSSGSESCVAVREDFADEEDFVKAGGSEILFVC^XJNI^DEQSKLV 96 
51 KCSAKSDRC VVDKQG I SVADEEDYvIIgGSELF W 100 
97 DKLPPISIGDGALDHVVIGCGPAGU\IJvABAKLGLKVGLIGPDLPFTNN 146 
101 EKU¥)IPIGNCIL^ 150 

147 YGVWEDEFNDLGLQKCIEHVWRETIVYLDDDKPITIGRAYGRVSRRLLHE 196 

J I M ' 1 1 1 Ml: MM |::|:||||| II MIIIMI I Mil 
151 YG\WEFIGLGLEGCIEHSWKDTLVYLlW 200 

197 ELLRRCVESGVS YLSSKVDS ITEASDGLRLVACDDNNAI PCRLATVASGA * 246 

_ MIIIMMIIIIIMI: MM -I |: |: I MINIMUM 

201 ELLRRCVESGVSYLSSKVERITEAPNGYSLI ECEGNITI PCRLATVASGA 250 
247 ^GKLLQYEVGGPRVCVQTAYGVEVEVENSPYDPtX^VTOYRDYTNEKV 296 
251 ASGKFLEYELGGPRVCVQTAYGIEVEV^ 300 

297 RSLEAEYPTFLYAMFWKSRLFFEETCU\SI<XIVMPFDLLI<TKLMLRLDTL 346 
™ N 1 1 - 1 1 1 1 1 1 I |. .::|||lllllh: IIMIMM II : ' 
301 ESLEAKYPTFLYVMAMSPTKIFFEETCUSREAMPmLLKSK^ 350 
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FIG. 28 B 

347 GIRILIOYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSL 396 

MM MMMMMMMMMMMMMMMMMMMMMMi 

351 GIRITRTYEEEWSYIPVGGSLPNTEQKNLAFGAAASMVHPATGYSVVRSL 400 

397 SEAPKYASVIAEILREETTKQINS. . . . .NISRQAWDTLWPPERKRQRAF 441 

MM 1 1 • ! 1 1 • 1 1 h = - I 1 1 1 : 1 1 1 : 1 1 1 1 MMMM 
401 SEAPNYAAVIAKI LRQDQSKEMISLGKYTNI SKQAWETLWPLERKRQRAF 450 

442 FLFGU\LIVQFDTEGIRSFFRTFFRLPKWMWQGFLGSTLTSGDLVLFALY 491 

1 1 1 1 1 > M I M I • 1 1 1 i 1 1 1 1 1 1 1 1 1 MMI-M W--WW 

451 FLFGLSHIVLXDLEGTRTFFRTFFRLPKWMWWGFLGSSLSSTDLIIFALY 500 

492 MFVISPNNLRKGLINHLISDPTGATMIKTYLKV* 525 

MM-I--M |: I h 1 1 1 1 1 1 1 h I M :| 
501 MFVIAPHSLRMELVRHLLSDPTGATMVKAYLTI* 534 
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SEQUENCE LISTING 

<110> CUNNINGHAM JR., FRANCIS X. 
SUN, ZAIREN 

<120> GENES OF CAROTENOID BIOSYNTHESIS AND METABOLISM AND 
METHODS OF USE THEREOF 

<130> 8172-9023 

<140> NOT YET ASSIGNED 
<141> 1999-06-02 

<150> 09/088,724 
<151> 1998-06-02 

<150> 09/088,725 
<151> 1998-06-02 

<160> 61 

<170> Patentln Ver . 2.0 

<210> 1 
<211> 1860 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 
<221> CDS 

<222> (109) . . (1680) 
<400> 1 

ACAAAAGGAA ATAATTAGAT TCCTCTTTCT GCTTGCTATA CCTTGATAGA ACAATATAAC 60 

AATGGTGTAA GTCTTCTCGC TGTATTCGAA ATTATTTGGA GGAGGAAA ATG GAG TGT 117 

Met Glu Cys 
1 

GTT GGG GCT AGG AAT TTC GCA GCA ATG GCG GTT TCA ACA TTT CCG TCA 165 
Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr Phe Pro Ser 
5 10 15 

TGG AGT TGT CGA AGG AAA TTT CCA GTG GTT AAG AGA TAC AGC TAT AGG 213 
Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr Ser Tyr Ara 
20 25 30 35 

AAT ATT CGT TTC GGT TTG TGT AGT GTC AGA GCT AGC GGC GGC GGA AGT 261 
Asn He Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly Gly Gly Ser 
40 45 50 

TCC GGT AGT GAG AGT TGT GTA GCG GTG AGA GAA GAT TTC GCT GAC GAA 309 
Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe Ala Asp Glu 
55 60 65 

GAA GAT TTT GTG AAA GCT GGT GGT TCT GAG ATT CTA TTT GTT CAA ATG 357 
Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe Val Gin Met 
70 75 80 

CAG CAG AAC AAA GAT ATG GAT GAA CAG TCT AAG CTT GTT GAT AAG TTG 4 05 



\ 
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Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val Asp Lys Leu 
8 5 90 95 

CCT CCT ATA TCA ATT GGT GAT GGT GCT TTG GAT CAT GTG GTT ATT GGT 4 53 

Pro Pro lie Ser lie' Gly Asp Gly Ala Leu Asp His Val Val. He Gly 
100 105 110 115 

TGT GGT CCT GCT GGT TTA GCC TTG GCT GCA GAA TCA GCT AAG CTT GGA 501 
Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Gly 
120 125 130 

TTA AAA GTT GGA CTC ATT GGT CCA GAT CTT CCT TTT ACT AAC AAT TAC 54 9 

Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
135 140 145 

GGT GTT TGG GAA GAT GAA TTC AAT GAT CTT GGG CTG CAA AAA TGT ATT 597 
Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys He 
150 155 160 

GAG CAT GTT TGG AGA GAG ACT ATT GTG TAT CTG GAT GAT GAC AAG CCT 64 5 

Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp Asp Lys Pro 
165 170 175 

ATT ACC ATT GGC CGT GCT TAT GGA AGA GTT AGT CGA CGT TTG CTC CAT 693 
He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His 
180 185 190 195 

GAG GAG CTT TTG AGG AGG TGT GTC GAG TCA GGT GTC TCG TAC CTT AGC 741 
Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser 
200 205 " 210 

TCG AAA GTT GAC AGC ATA ACA GAA GCT TCT GAT GGC CTT AGA CTT GTT 78 9 

Ser Lys .Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu Arg Leu Val 
215 220 " 225 

GCT TGT GAC GAC AAT AAC GTC ATT CCC TGC AGG CTT GCC ACT GTT GCT 8 37 

Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala Thr Val Ala 
230 235 240 

TCT GGA GCA GCT TCG GGA AAG CTC TTG CAA TAC GAA GTT GGT GGA CCT 885 
Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro 
245 250 255 

AGA GTC TGT GTG CAA ACT GCA TAC GGC GTG GAG GTT GAG GTG GAA AAT 933 
Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Glu Asn 
260 265 270 275 

AGT CCA . TAT GAT CCA GAT CAA ATG GTT TTC ATG GAT TAC AGA GAT TAT 981 
Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr 
280 285 ' 290 

ACT AAC GAG AAA GTT CGG AGC TTA GAA GCT GAG TAT CCA ACG TTT CTG 102 9 

Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu 
295 300 305 

TAC GCC ATG CCT ATG ACA AAG TCA AGA CTC TTC TTC GAG GAG ACA TGT 1077 
Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys 
310 315 320 

TTG GCC TCA AAA GAT GTC ATG CCC TTT GAT TTG CTA AAA ACG AAG CTC 112 5 

Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu 
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325 330 335 

ATG TTA AGA TTA GAT ACA CTC GGA ATT CGA, ATT CTA AAG ACT TAC GAA 1173 
Met Leu Arg Leu Asp Thr Leu Gly He Arg'lle Leu Lys Thr Tyr Glu 

345 350 - 355 

GAG GAG TGG TCC TAT ATC CCA GTT GGT GGT TCC TTG CCA AAC ACC GAA 1221 
Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 
360 365 370 

CAA AAG AAT CTC GCC TTT GGT GCT GCC GCT AGC ATG GTA CAT CCC GCA 12 69 

Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
375 380 385 

ACA GGC TAT TCA GTT GTG AGA TCT TTG TCT GAA GCT CCA AAA TAT GCA 1317 
Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Tyr Ala 
390 395 400 

TCA GTC ATC GCA GAG ATA CTA AGA GAA GAG ACT ACC AAA CAG ATC AAC 1365 
Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys Gin He Asn 
405 410 415 

AGT AAT ATT TCA AGA CAA GCT TGG GAT ACT TTA TGG CCA CCA GAA AGG 1413 
Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro Pro Glu Ara 
420 425 430 435 

AAA AGA CAG AGA GCA TTC TTT CTC TTT GGT CTT GCA CTC ATA GTT CAA 14 61 

Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu He Val Gin 
440 445 450 

TTC GAT ACC GAA GGC ATT AGA AGC TTC TTC CGT ACT TTC TTC CGC CTT 1509 
Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe Phe Arg Leu 
455 460 465 

CCA AAA TGG ATG TGG CAA GGG TTT CTA GGA TCA ACA TTA ACA TCA GGA 1557 
Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu Thr Ser Glv 
470 475 480 

GAT CTC GTT CTC TTT GCT TTA TAC ATG TTC GTC ATT TCA CCA AAC AAT 1605 
Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser Pro Asn Asn 
485 490 495 

TTG AGA AAA GGT CTC ATC AAT CAT CTC ATC TCT GAT CCA ACC GGA GCA 1653 
Leu Arg Lys Gly Leu He Asn His Leu He Ser Asp Pro Thr Gly Ala 
500 505 510 515 

ACC ATG ATA AAA ACC TAT CTC AAA GTA TGATTTACTT ATCAACTCTT 1700 
Thr Met He Lys Thr Tyr Leu Lys Val 
520 

AGGTTTGTGT ATATATATGT TGATTTATCT GAATAATCGA TCAAAGAATG GTATGTGGGT 17 60 

TACTAGGAAG TTGGAAACAA ACATGTATAG AATCTAAGGA GTGATCGAAA TGGAGATGGA 1820 

AACGAAAAGA AAAAAATCAG TCTTTGTTTT GTGGTTAGTG I860 



<210> 2 
<211> 524 
<212> PRT 

<213> Arabidopsis thaliana 



3 



BNSDOC1D: <WO 9963055A1_IA> 
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<4 00> 2 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
1 5 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser Tyr Arg Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 " 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 J 95 

Asp Lys Leu Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val 
100 105 110 

Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 * " 160 

Lys Cys lie Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro lie Thr lie Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val lie Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 ^ 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 
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Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly He Arg He Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro 
355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lys Tyr Ala Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
405 410 415 

Gin lie Asn Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
4 20 4 25 4 30 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe ' Leu Phe Gly Leu Ala Leu 
435 440 445 

He Val Gin Phe Asp Thr Glu Gly lie Arg Ser Phe Phe Arg Thr Phe 
4 50 4 55 4 60 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val lie Ser 
485 490 495 

Pro Asn Asn Leu Arg Lys Gly Leu lie Asn His Leu lie Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 3 
<211> 956 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 3 



GCTCTTTCTC 


CTCCTCCTCT 


ACCGATTTCC 


GACTCCGCCT 


CCCGAAATCC 


TTATCCGGAT 


60 


TCTCTCCGTC 


TCTTCGATTT 


AAACGCTTTT 


CTGTCTGTTA 


CGTCGTCGAA 


GAACGGAGAC 


120 


AGAATTCTCC 


GATTGAGAAC 


GATGAGAGAC 


CGGAGAGCAC 


GAGCTCCACA 


AACGCTATAG 


180 


ACGCTGAGTA 


TCTGGCGTTG 


CGTTTGGCGG 


AGAAATTGGA 


GAGGAAGAAA 


TCGGAGAGGT 


240 


CCACTTATCT 


AATCGCTGCT 


ATGTTGTCGA 


GCTTTGGTAT 


CACTTCTATG 


GCTGTTATGG 


300 


CTGTTTACTA 


CAGATTCTCT 


TGGCAAATGG 


AGGGAGGTGA 


GATCTCAATG 


T TGG AAATG T 


360 


TTGGTACATT 


TGCTCTCTCT 


GTTGGTGCTG 


CTGTTGGTAT 


GGAATTCTGG 


GCAAGATGGG 


420 


CTCATAGAGC 


TCTGTGGCAC 


GCTTCTCTAT 


GGAATATGCA 


TGAGTCACAT 


CACAAACCAA 


480 
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GAGAAGGACC 


GTTTGAGCTA 


AACGATGTTT 


TTGCTATAGT 


GAACGCTGGT 


CCAGCGATTG 


540 


GTCTCCTCTC 


TTATGGATTC 


TTCA^TAAAG 


GACTCGTTCC 


TGGTCTCTGC 


TTTGGCGCCG 


600 


GGTTAGGCAT 


AACGGTGTTT 


GGAATCGCCT 


ACATGTTTGT 


CCACGATGGT 


CTCGTGCACA 


660 


AGCGTTTCCC 


TGTAGGTCCC 


ATCGCCGACG 


TCCCTTACCT 


CCGAAAGGTC 


GCCGCCGCTC 


720 


ACCAGCTACA 


TCACACAGAC 


AAGTTCAATG 


GTGTACCATA 


TGGACTGTTT 




ion 

/ o U 


AGGAATTGGA 


AGAAGTTGGA 


GGAAATGAAG 


AGTTAGATAA 


GGAGATTAGT 


CGGAGAATCA 


840 . 


AATCATACAA 


AAAGGCCTCG 


GGCTCCGGGT 


CGAGTTCGAG 


TTCTTGACTT 


TAAACAAGTT 


900 


TTAAATCCCA 


AATTCTTTTT 


TTGTCTTCTG 


TCATTATGAT 


CATCTTAAGA 


CGGTCT 


956 



<210> 4 
<211> 294 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 4 

Ser Phe Ser Ser Ser Ser Thr Asp Phe Arg Leu Arg Leu Pro Lys Ser 
1 5 10 15 

Leu Ser Gly Phe Ser Pro Ser Leu Arg Phe Lys Arg Phe Ser Val Cys 
20 25 30 

Tyr Val Val Glu Glu Arg Arg Gin Asn Ser Pro He Glu Asn Asp Glu 
35 4 0 4 5 

Arg Pro Glu Ser Thr Ser Ser Thr Asn Ala He Asp Ala Glu Tyr Leu 
50 55 60 

Ala Leu Arg Leu Ala Glu Lys Leu Glu Arg Lys Lys Ser Glu Arg Ser 
65 70 75 80 

Thr Tyr Leu He Ala Ala Met Leu Ser Ser Phe Gly He Thr Ser Met- 
85 90 95 

Ala Val Met Ala Val Tyr Tyr Arg Phe Ser Trp Gin Met Glu Gly Gly 
100 105 110 

Glu He Ser Met Leu Glu Met Phe Gly Thr Phe Ala Leu Ser Val Gly 
115 120 125 

Ala Ala Val Gly Met Glu Phe Trp Ala Arg Trp Ala His Arg Ala Leu 
130 135 140 

Trp His Ala Ser Leu Trp Met Asn His Glu Ser His His Lys Pro Arg 
14 5 150 155 160 

Glu Gly Pro Phe Glu Leu Asn Asp Val Phe Ala He Val Asn Ala Gly 
165 170 175 

Pro Ala He Gly Leu Leu Ser Tyr Gly Phe Phe Asn Lys Gly Leu Val 
180 185 190 

Pro Gly Leu Cys Phe Gly Ala Gly Leu Gly He Thr Val Phe Gly He 
195 200 205 
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Ala Tyr Met Phe Val His Asp Gly Leu Val His Lys Arg Phe Prc/val 
210 215 220 

. Gly Pro He Ala Asp Val Pro Tyr Leu Arg Lys Val Ala Ala Ala His 
225 230 235 240 

Gin Leu His His Thr Asp Lys Phe Asn Gly Val Pro Tyr Gly Leu Phe 
245 250 255 

Leu Gly Pro Lys Glu Leu Glu Glu Val Gly Gly Asn Glu Glu Leu Asd 
260 265 270 

Lys Glu He Ser Arg Arg He Lys Ser Tyr Lys Lys Ala Ser Gly Ser 
275 280 285 

Gly Ser Ser Ser Ser Ser 
290 

<210> 5 
<211> 162 
<212> PRT 

<213> Alicalgenes sp. 
<400> 5 

Met Thr Gin Phe Leu He Val Val Ala Thr Val Leu Val Met Glu Leu 
1 5 io 15 

Thr Ala Tyr Ser Val His Arg Trp He Met His Gly Pro Leu Gly Trp 
20 25 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lvs 
35 40 45 y 

Asn Asp Leu Tyr Gly Val Val Phe Ala Val Leu Ala Thr He Leu Phe 
50 55 60 

Thr Val Gly Ala Tyr Trp Trp Pro Val Leu Trp Trp He Ala Leu Gly 
65 70 75 " eo 

Met Thr Val Tyr Gly Leu He Tyr Phe He Leu His Asp Gly Leu Val 

85 90 95 

His Gin Arg Trp Pro Phe Arg Tyr He Pro Arg Arg Gly Tyr Phe Ara 
100 105 no 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arq Asn 
115 120 125 

His Cys Val Ser Phe Gly Phe He Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Arg Ser Gly Val Leu Arg Pro Gin Asp Glu Arq 
145 150 155 i 6 o 

Pro Ser 



<210> 6 
<211> 175 
<212> PRT 

<213> Erwinia herbicola 



7 
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<400> 6 

Met Leu Asn Ser Leu lie Val lie Leu Ser Val lie Ala Met Glu Gly 
1 5 10 15 

He Ala Ala Phe Thr His Arg Tyr He Met His Gly Trp Gly Trp Arg 

20 .25 30 

Trp His Glu Ser His His Thr Pro Arg Lys Gly Val Phe Glu Leu Asn 
35 40 45 

Asp Leu Phe Ala Val Val Phe Ala Gly Val Ala He Ala Leu He Ala 
50 55 60 

Val Gly Thr Ala Gly Val Trp Pro Leu Gin Trp He Gly Cys Gly Met 
65 70 75 80 

Thr Val Tyr Gly Leu Leu Tyr Phe Leu Val His Asp Gly Leu Val His 
85 90 95 

Gin Arg Trp Pro Phe His Trp lie Pro Arg Arg Gly Tyr Leu Lys Arg 
100 105 110 

Leu Tyr Val Ala His Arg Leu His His Ala Val Arg Gly Arg Glu Gly 
115 120 125 

Cys Val Ser Phe Gly Phe He Tyr Ala Arg Lys Pro Ala Asp Leu Gin 
130 135 140 

Ala He Leu Arg Glu Arg His Gly Arg Pro Pro Lys Arg Asp Ala Ala 
145 150 " 155 160 

Lys Asp Arg Pro Asp Ala Ala Ser Pro Ser Ser Ser Ser Pro Glu 
165 170 175 

<210> 7 
<211> 175 
<212> PRT 

<213> Erwinia uredovora 
<400> 7 

Met Leu Trp He Trp Asn Ala Leu He Val Phe Val Thr Val He Gly 
15 10 15 

Met Glu Val He Ala Ala Leu Ala His Lys Tyr He Met His Gly Trp 
20 25 30 

Gly Trp Gly Trp His Leu Ser His His Glu Pro Arg Lys Gly Ala Phe 
35 40 45 

Glu Val Asn Asp Leu Tyr Ala Val Val Phe Ala Ala Leu Ser He Leu 
50 55 60 

Leu He Tyr Leu Gly Ser Thr Gly Met Trp Pro Leu Gin Trp He Gly 
65 70 75 80 

Ala Gly Met Thr Ala Tyr Gly Leu Leu Tyr Phe Met Val His Asp Gly 
85 90 95 

Leu Val His Gin Arg Trp Pro Phe Arg Tyr lie Pro Arg Lys Gly Tyr 
100 105 110 
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Leu Lys Arg Leu Tyr Met Ala His Arg Met His His Ala Val Arg "Gly 
115 120 125 

Lys Glu Gly Cys Val.Ser Phe Gly Phe Leu Tyr Ala Pro Pro Leu Ser 
130 135 140 

Lys Leu Gin Ala Thr Leu Arg Glu Arg His Gly Ala Arg Ala Gly Ala 
145 150 155 160 

Ala Arg Asp Ala Gin Gly Gly Glu Asp Glu Pro Ala Ser Gly Lys 
165 170 175 

<210> 8 
<211> 162 
<212> PRT 

<213> Agrobacterium aurianticum 
<400> 8 

Met Thr Asn Phe Leu lie Val Val Ala Thr Val Leu Val Met Glu Leu 
15 10 15 

Thr Ala Tyr Ser Val His Arg Trp lie Met His Gly Pro Leu Gly Trp 
20 25 ' 30 

Gly Trp His Lys Ser His His Glu Glu His Asp His Ala Leu Glu Lys 
35 40 45 

Asn Asp Leu Tyr Gly Leu Val Phe Ala Val lie Ala Thr Val Leu Phe 
50 55 60 

Thr Val Gly Trp He Trp Ala Pro Val Leu Trp Trp He Ala Leu Gly 
" 70 75 80 

Met Thr Val Tyr Gly Leu He Tyr Phe Val Leu His Asp Gly Leu Val 
85 90 * 95 

His Trp Arg Trp Pro Phe Arg Tyr He Pro Arg Lys Gly Tyr Ala Arg 
100 105 " HO 

Arg Leu Tyr Gin Ala His Arg Leu His His Ala Val Glu Gly Arg Asp 
115 120 125 

His Cys Val Ser Phe Gly Phe He Tyr Ala Pro Pro Val Asp Lys Leu 
130 135 140 

Lys Gin Asp Leu Lys Met Ser Gly Val Leu Arg Ala Glu Ala Gin Glu 
145 150 155 160 

Arg Thr 

<210> 9 
<211> 954 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 9 

CCACGGGTCC GCCTCCCCGT TTTTTTCCGA TCCGATCTCC GGTGCOGAGG ACTCAGCTGT 60 
TTGTTCGCGC TTTCTCAGCC GTCACCATGA CCGATTCTAA CGATGCTGGA ATGGATGCTG 120 
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TTCAGAGACG 


ACTCATGTTT 


GAAGACGAAT 


GCATTCTGGT 


TGATGAAAAT 


AATCGTGTGG 


180 


TGGGACATGA 


CACTAAGTAT 


AACTGTCATC 


TGATGGAAAA 


GATTGAAGCT 


GAGAATTTAC 


240 


TTCACAGAGC 


TTTCAGTGTG 


TTTTTATTCA 


ACTCCAAGTA 


TGAGTTGCTT 


CTCCAGCAAC 


300 


GGTCAAAAAC 


AAAGGTTACT 


TTCCCACTTG 


TGTGGACAAA 


CACTTGTTGC 


AGCCATCCTC 


360 


TTTACCGTGA 


ATCCGAGCTT 


ATTGAAGAGA 


ATGTGCTTGG 


TGTAAGAAAT 


GCCGCACAAA 


420 


GGAAGCTTTT 


CGATGAGCTC 


GGTATTGTAG 


CAGAAGATGT 


ACCAGTCGAT 


GAGTTCACTC 


480 


CCTTGGGACG 


CATGCTTTAC 


AAGGCACCTT 


CTGATGGGAA 


ATGGGGAGAG 


CACGAAGTTG 


540 


ACTATCTACT 


CTTCATCGTG 


CGGGATGTGA 


AGCTTCAACC 


AAACCCAGAT 


GAAGTGGCTG 


600 


AGATCAAGTA 


CGTGAGCAGG 


GAAGAGCTTA 


AGGAGCTGGT 


GAAGAAAGCA 


GATGCTGGCG 


660 


ATGAAGCTGT 


GAAACTATCT 


CCATGGTTCA 


GATTGGTGGT 


GGATAATTTC 


TTGATGAAGT 


720 


GGTGGGATCA 


TGTTGAGAAA 


GGAACTATCA 


CTGAAGCTGC 


AGACATGAAA 


ACCATTCACA 


780 


AGCTCTGAAC 


TTTCCATAAG 


TTTTGGATCT 


TCCCCTTCCC 


ATAATAAAAT 


TAAGAGATGA 


840 


GACTTTTATT 


GATTACAGAC 


AAAACTGGCA 


ACAAAATCTA 


TTCCTAGGAT 


TTTTTTTTGC 


900 


TTTTTATTTA 


CTTTTGATTC 


ATCTCTAGTT 


TAGTTTTCAT 


CTTAAAAAAA 


AAAA 


954 



<210> 10 
<211> 996 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 10 

CACCAATGTC TGTTTCTTCT TTATTTAATC TCCCATTGAT TCGCCTCAGA TCTCTCGCTC 60 

TTTCGTCTTC TTTTTCTTCT TTCCGATTTG CCCATCGTCC TCTGTCATCG ATTTCACCGA 120 

GAAAGTTACC GAATTTTCGT GCTTTCTCTG GTACCGCTAT G AC AG AT AC T AAAGATGCTG 180 

GTATGGATGC TGTTCAGAGA CGTCTCATGT TTGAGGATGA ATGCATTCTT GTTGATGAAA 24 0 

CTGATCGTGT TGTGGGGCAT GTCAGCAAGT ATAATTGTCA TCTGATGGAA AATATTGAAG 300 

CCAAGAATTT GCTGCACAGG GCTTTTAGTG TATTTTTATT CAACTCGAAG TATGAGTTGC 360 

TTCTCCAGCA AAGGTCAAAC ACAAAGGTTA CGTTCCCTCT AGTGTGGACT AACACTTGTT 4 20 

GCAGCCATCC TCTTTACCGT GAATCAGAGC TTATCCAGGA CAATGCACTA GGTGTGAGGA 4 80 

ATGCTGCACA AAGAAAGCTT CTCGATGAGC TTGGTATTGT AGCTGAAGAT GTACCAGTCG 54 0 

ATGAGTTCAC TCCCTTGGGA CGTATGCTGT ACAAGGCTCC TTCTGATGGC AAATGGGGAG 600 

AGCATGAACT TGATTACTTG CTCTTCATCG TGCGAGACGT GAAGGTTCAA CCAAACCCAG 660 

ATGAAGTAGC TGAGATCAAG TATGTGAGCC GGGAAGAGCT GAAGGAGCTG GTGAAGAAAG 720 

CAGATGCAGG TGAGGAAGGT TTGAAACTGT CACCATGGTT CAGATTGGTG GTGGACAATT 780 



10 
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TCTTGATGAA GTGGTGGGAT CATGTTGAGA AAGGAACTTT 
AAACCATCCA CAAACTCTGA ACATCTTTTT TTAAAGTTTT 
TCATCATTTT TATCTTTTCG ATGATAATAA TTTGGGATAT 
CCAAGCACCT CAGGCAATAA TAAAGTTTGC GGCCGC 

<210> 11 
<211> 1165 
<212> DNA 

<213> Haematococcus pluvialis 
<400> 11 



PCT/US99/12121 

GGTTGAAGCT ATAGACATGA 84 0 

TAAATCAATC AACTTTCTCT 900 
GTGAGACACT TACAAAACTT 960 

996 



CTCGGTAGCT 


GGCCACAATC 


GCTATTTGGA 


ACCTGGCCCG 


GCGGCAGTCC 


GATGCCGCGA 


60 


TGCTTCGTTC 


GTTGCTCAGA 


GGCCTCACGC 


ATATCCCCCG 


CGTGAACTCC 


GCCCAGCAGC 


120 


CCAGCTGTGC 


ACACGCGCGA 


CTCCAGTTTA 


AGCTCAGGAG 


CATGCAGATG 


ACGCTCATGC 


180 


AGCCCAGCAT 


CTCAGCCAAT 


CTGTCGCGCG 


CCGAGGACCG 


CACAGACCAC 


ATGAGGGGTG 


240 


CAAGCACCTG 


GGCAGGCGGG 


CAGTCGCAGG 


ATGAGCTGAT 


GCTGAAGGAC 


GAGTGCATCT 


300 


TGGTGGATGT 


TGAGGACAAC 


ATCACAGGCC 


ATGCCAGCAA 


GCTGGAGTGT 


CACAAGTTCC 


360 


TACCACATCA 


GCCTGCAGGC 


CTGCTGCACC 


GGGCCTTCTC 


TGTGTTCCTG 


TTTGACGATC 


420 


AGGGGCGACT 


GCTGCTGCAA 


CAGCGTGCAC 


GCTCAAAAAT 


CACCTTCCCA 


AGTGTGTGGA 


480 


CGAACACCTG 


CTGCAGCCAC 


CCTTTACATG 


GGCAGACCCC 


AGATGAGGTG 


GACCAACTAA 


540 


GCCAGGTGGC 


CGACGGAACA 


GTACCTGGCG 


CAAAGGCTGC 


TGCCATCCGC 


AAGTTGGAGC 


600 


ACGAGCTGGG 


GATACCAGCG 


CACCAGCTGC 


CGGCAAGCGC 


GTTTCGCTTC 


CTCACGCGTT 


660 


TGCACTACTG 


TGCCGCGGAC 


GTGCAGCCAG 


CTGCGACACA 


ATCAGCGCTC 


TGGGGCGAGC 


720 


ACGAAATGGA 


CTACATCTTG 


TTCATCCGGG 


CCAACGTCAC 


CTTGGCGCCC 


AACCCTGACG 


780 


AGGTGGACGA 


AGTCAGGTAC 


GTGACGCAAG 


AGGAGCTGCG 


GCAGATGATG 


CAGCCGGACA 


840 


ACGGGCTGCA 


ATGGTCGCCG 


TGGTTTCGCA 


TCATCGCCGC 


GCGCTTCCTT 


GAGCGTTGGT 


900 


GGGCTGACCT 


GGACGCGGCC 


CTAAACACTG 


ACAAACACGA 


GGATTGGGGA 


ACGGTGCATC 


960 


ACATCAACGA 


AGCGTGAAAG 


CAGAAGCTGC 


AGGATGTGAA 


GACACGTCAT 


GGGGTGGAAT 


1020 


TGCGTACTTG 


GCAGCTTCGT 


ATCTCCTTTT 


TCTGAGACTG 


AACCTGCAGT 


CAGGTCCCAC 


1080 


AAGGTCAGGT 


AAAATGGCTC 


GATAAAATGT 


ACCGTCACTT 


TTTGTCGCGT 


ATACTGAACT 


1140 


CCAAGAGGTC 


AAAAAAAAAA 


AAAAA 








1165 



<210> 12 
<211> 1135 
<212> DNA 

<213> Haematococcus pluvialis 



M 
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<400> 12 


(~ t\ n atp 

buLLALAM i b 


bb 1 Pi.1 I I bbrt 


ALL 1 bbbbLG 


GCGGCAGTCC 


GATGCCGCGA 


60 


1 V3\_ 1 1 V^Vj 1 1 L 


oil bL 1 LHbM 


obit 1 bAbbb 


A I A 1 bbbbCb 


CGTGAACTCC 


GCCCAGCAGC 


120 




APAr'pprzr'PA 


PTrrnrTTTa 

L 1 LLAb I 1 Irt. 


Abb I LAbbAb 


CATGCAGCTG 


CTTTCCGAGG 


180 


APPPPAPAfA 


V-L^/AV^M 1 bAbb 


bb J bLAAbLA 


bb I bbbbAbG 


CGGGCAGTCG 


CAGGATGAGC 


240 




PP APP APTPP 


A1L1 IbolAL 


Aibl 1 bAbb A 


TV TV TV rT> TV ^» T\ 

CAACATCACA 


GGCCATGCCA 


300 


PPAAPPTPP A 


PTPTPAf^AAP 


TTPPTSPPflP 


A 1 bAbbb 1 bb 


AGGCCTGCTG 


CACCGGGCCT 


360 


TPTPTPTPTT 


PPTPTTTP AP 


P ATP APPPPP 


PflPTPPTPPT 
bflb I bl J. bb 1 


bbAAbAGCGT 


GCACGCTCAA 


420 


AAATPACPTT 


PPPAAPTPTP 


TPPAPPAAPA 


pptpptppqp 

v-V-' I V3^- 1 bbnb 


bbAbbb 1 1 TA 


CATGGGCAGA 


4 80 


CCCCAGATGA 


GGTPGAPPAA 


PTAAPPPAPP 


TPPPPP APPP 
l bbbbbnbbb 


anrarT7\ rr^nr* 
AAbAb 1 Abb 1 


GGCGCAAAGG 


540 


CTGCTGCCAT 


CCGCAAGTTG 


GAPPAPPAPP 


TPPPPATAPP 


Abb bbAb bAb 


b I bbCGGCAA 


600 


GCGCGTTTCG 


PTTPPTPAPP 


PPTTTPPAPT 


r^v_ 1 b 1 bbbbb 


bbAbb 1 bbAb 


CCAGCTGCGA 


660 


PAPAATPAPP 


PPTPTPPPPP 


PAPPAPPAAA 


TPpapTapaT 

1 bbAb 1 AbA 1 


b 1 I b 1 1 bATC 


CGGGCCAACG 


720 


TPAPPTTPPP 


PPPP A APPPT 


PAP^r*Ar*r*Tr*r* 

oALbaAbb 1 bb 


AbbAAb 1 bAb 


bTACGTGACG 


CAAGAGGAGC 


780 


TPPPPP A P* A T 


bA 1 bLAbLLb 


bAbAAbbbbL 


1 1 bAA I bbTb 


GCCGTGGTTT 


CGCATCATCG 


840 




bb 1 1 bAbbb I 


1 bb 1 bbbl I b 


Abb I bbACGC 


GGCCCTAAAC 


ACTGACAAAC 


900 


ACGAGGATTG 


GGGAACGGTG 


CATCACATCA 


ACGAAGCGTG 


AAGGCAGAAG 


CTGCAGGATG 


960 


TGAAGACACG 


TCATGGGGTG 


GAATTGCGTA 


CTTGGCAGCT 


TCGTATCTCC 


TTTTTCTGAG 


1020 


ACTGAACCTG 


CAGAGCTAGA 


GTCAATGGTG 


CATCATATTC 


ATCGTCTCTC 


TTTTGTTTTA 


1080 


GACTAATCTG 


TAGCTAGAGT 


bAb i b A 1 bAA 


1 bb 1 1 1ALAA 


CTTTCAAAAA 


AAAAA 


1135 


<210> 13 

<211> 960 c 
<212> DNA 

<213> Tagetes erecta 












<400> 13 
CCAAAAACAA 


CTCAAATCTC 


PTPPPTPPPT 
v— x bbb 1 LLdL 1 


PTTDPTPPPP 
bl lAblbbbb 


bATGGGTGAC 


GACTCCGGCA 


60 


TGGATGCTGT 


TCAGCGACGT 


\— x 1 \j X 1 x O 


APP QTP A 2\TP 
nbbn 1 bnn 1 b 


7\ rn rp rp rp y-t rp /— • 

LAI 1 1 1GGTG 


GATGAGTGTG 


120 


ACAATGTGGT 


GGGACATGAT 


APPAAATAPA 


ATTPTPAPTT 

* \ X X \J X Vyrtb 1 1 


7\ Trp ar*7\ 7\p 
bft 1 bbAbAAb 


T\ rp rp f~> T\ 7» a TV 

ATT G AAAC AG 


TOO 

180 


GTAAAATGCT 


GCACAGAGCA 


TTP APPPT TT 
a J- v^nuv*b XXX 


TTPTATTPAA 


TTPAAAATAr 1 
1 1 bAAAA I Ab 


PTVPTTTV r~» rprp r~* 

bAb 1 1 Ab TTb 


24 0 


TTCAGCAACG 


GTCTGCAACC 


AAGGTGACAT 


TTCCTTTAGT 


ATGGACCAAC 


ACCTGTTGCA 


300 


GCCATCCACT 


C T AC AG AG AA 


TCCGAGCTTG 


TTCCCGAAAC 


GCCTGAGAGA 


ATGCTGCACA 


360 


GAGGANNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


420 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


480 



»Z- 
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NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


540 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


600 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


NNNNNNNNNN 


660 


NNNNNNNNNN 


NNNNNNNNNN 


TCATGTGCAA 


AAGGGTACAC 


TCACTGAATG 


CAATTTGATA 


720 


TGAAAACCAT 


ACACAAGCTG 


ATATAGAAAC 


ACACCCTCAA 


OCGAAAAGCA 


AGCCTAATAA 


780 


TTCGGGTTGG 


GTCGGGTCTA 


CCATCAATTG 


TTTTTTTCTT 


TTAACAACTT 


TTAATCTCTA 


840 


TTTGAGCATG 


TTGATTCTTG 


TCTTTTGTGT 


GTAAGATTTT 


GGGTTTCGTT 


TCAGTTGTAA 


900 


TAATGAACCA 


TTGATGGTTT 


GCAATTTCAA 


GTTCCTATCG 


ACATGTAGTG 


ATCTAAAAAA 


960 



<210> 14 

<211> 305 

<212> PRT 

<213> Haematococcus pluvialis 

<400> 14 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His lie Pro Arg Val Asn 
1 5 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser He Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys He 
65 70 75 A ^ 80 

Leu Val Asp Val Glu Asp Asn He Thr Gly His Ala Ser Lys Leu Glu 
85 90 ~ 95 

Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu Leu His Arg Ala 
100 105 ~ 110 

Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin 
115 120 " 125 

Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp Thr Asn Thr Cys 
130 135 140 

Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu 
145 150 155 ' 160 

Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala He 
165 170 175 

Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His Gin Leu Pro Ala 
180 185 190 

Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val 
195 200 205 
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Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp 
210 215 220 

Tyr- He Leu Phe He Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp 
225 • 230 235 2 40 

Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met 
245 250 255 

Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp Phe Arg He He 
260 265 ' 270 

Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu ■ 
275 280 * 285 

Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His His He Asn Glu 
290 295 300 

Ala 

305 

<210> 15 
<211> 293 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 15 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His He Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 ' 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arq Glv 
35 40 45 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu Cys He Leu Val Asp Val Glu Asp Asn He Thr Gly His Ala 
65 70 75 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 
85 90 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu 
100 105 no 

Leu Leu Gin Gin Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp 
115 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 
145 150 155 160 

Ala Ala Ala He Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His 
165 170 175 

Gin Leu Pro Ala Ser Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys 
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180 185 190 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin ,Ser Ala Leu Trp Gly Glu 
195 200 205 

His Glu Met Asp Tyr He Leu Phe He Arg Ala Asn Val Thr Leu Ala 
210 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr <Sln Glu Glu 
225 230 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 255 

Phe Arg He He Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu 
260 265 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 
275 280 285 

His He Asn Glu Ala 
290 

<210> 16 
<211> 284 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 16 

Met Ser Val Ser Ser Leu Phe Asn Leu Pro Leu He Arg Leu Arg Ser 
1 5 10 15 

Leu Ala Leu Ser Ser Ser Phe Ser Ser Phe Arg Phe Ala His Arg Pro 
20 25 30 

Leu Ser Ser He Ser Pro Arg Lys Leu Pro Asn Phe Arg Ala Phe Ser 
35 40 45 

Gly Thr Ala Met Thr Asp Thr Lys Asp Ala Gly Met Asp Ala Val Gin 
50 55 60 

Arg Arg Leu Met Phe Glu Asp Glu Cys He Leu Val Asp Glu Thr Asp 
65 70 75 * 80 

Arg Val Val Gly His Val Ser Lys Tyr Asn Cys His Leu Met Glu Asn 
85 90 95 

He Glu Ala Lys Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe 
100 105 110 

Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Asn Thr Lys Val 
115 120 125 

Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr 
130 135 140 

Arg Glu Ser Glu Leu He Gin Asp Asn Ala Leu Gly Val Arg Asn Ala 
145 150 155 160 

Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly He Val Ala t31u Asp Val 
165 170 175 
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Pro Val Asp Glu "Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro 
180 . 185 190 

Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe He 
195 200 205 

Val Arg Asp Val Lys Val Gin Pro Asn Pro Asp Glu Val Ala Glu He 
210 215 220 

Lys Tyr Val Ser Arg Glu Glu Leu Lys Glu Leu Val Lys Lys Ala Asp 
225 230 235 ^ ' 240 

Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val ' 
245 250 255 

Asp Asn Phe Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr Leu 
260 265 270 

Val Glu Ala' He Asp Met Lys Thr He . His Lys Leu 
275 280 

<210> 17 
<211> 287 
<212> PRT 

<213> Clarkia breweri 
<400> 17 

Met Ser Ser Ser Met Leu Asn Phe Thr Ala Ser Arg He Val Ser Leu 
1 5 10 15 

Pro Leu Leu Ser Ser Pro Pro Ser Arg Val His Leu Pro Leu Cys Phe 
20 25 30 

Phe Ser Pro He Ser Leu Thr Gin Arg Phe Ser Ala Lys Leu Thr Phe 
35 40 45 

Ser Ser Gin Ala Thr Thr Met Gly Glu Val Val Asp Ala Gly Met Asp 
50 55 60 

Ala Val Gin Arg Arg Leu Met Phe Glu Asp Glu Cys He Leu Val Asp 
65 70 75 80 

Glu Asn Asp Lys Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu 
85 90 95 

Met Glu Lys lie Glu Ser Glu Asn Leu Leu His Arg Ala Phe Ser Val 
100 105 " HO 

Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala 
115 120 125 

Thr Lys Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His 
130 135 140 

Pro Leu Tyr Arg Glu Ser Glu Leu lie Asp Glu Asn Cys Leu Gly Val 
145 150 155 160 

Arg Asn Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly lie Pro Ala 
165 170 ' 175 

Glu Asp Leu Pro Val Asp Gin Phe lie Pro Leu Ser Arg lie Leu Tyr 



lit 
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180 * 185 190 

Lys Ala Pro Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu 
195 200 205 

Leu Phe He He Arg Asp Val Asn Leu Asp Pro Asn Pro Asp Glu Val 
210 215 220 

Ala Glu Val Lys Tyr Met Asn Arg Asp Asp Leu Lys Glu Leu Leu Arq 
225 230 235 240 

Lys Ala Asp Ala Glu Glu Glu Gly Val Lys Leu Ser Pro Trp Phe Arg 
245 250 255 

Leu Val Val Asp Asn Phe Leu Phe Lys Trp Trp Asp His Val Glu Lys 
260 265 270 

Gly Ser Leu Lys Asp Ala Ala Asp Met ■ Lys Thr He His Lys Leu 
275 280 " 285 

<210> 18 
<211> 261 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 18 

Thr Gly Pro Pro Pro Arg Phe Phe Pro He Arg Ser Pro Val Pro Arq 
1 5 10 15 

Thr Gin Leu Phe Val Arg Ala Phe Ser Ala Val Thr Met Thr Asp Ser 
20 25 30 

Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe Glu Asp 
35 40 45 

Glu Cys He Leu Val Asp Glu Asn Asn Arg Val Val Gly His Asp Thr 
50 55 60 

Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Ala Glu Asn Leu Leu 
65 70 75 80 

His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu Leu 
85 90 95 

Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro Leu Val Trp Thr 
100 105 110 

Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu He Glu 
115 120 125 

Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Phe Asp 
130 135 140 

Glu Leu Gly He Val Ala Glu Asp Val Pro Val Asp Glu Phe Thr Pro 
145 150 155 * 160 

Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly Glu 
165 170 ^ 175 

His Glu Val Asp Tyr Leu Leu Phe He Val Arg Asp Val Lys Leu Gin 
180 185 190 
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Pro Asn Pro Asp "Glu Val Ala Glu lie Lys Tyr Val Ser Arg Glu Glu 
195 200 205 

Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp Glu Ala Val Lys 
210 1 215 220 

Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Met Lys Trp 
225 230 235 240 

Trp Asp His Val Glu Lys Gly Thr He Thr Glu Ala Ala Asp Met Lys 
245 250 255 

Thr He His Lys Leu 
260 

<210> 19 
<211> 288 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 19 

Met Thr Ala Asp Asn Asn Ser Met Pro His Gly Ala Val Ser Ser Tyr 
1 5 10 15 

Ala Lys Leu Val Gin Asn Gin Thr Pro Glu Asp He Leu Glu Glu Phe 
20 25 30 

Pro Glu He He Pro Leu Gin Gin Arg Pro Asn Thr Arg Ser Ser Glu 
35 40 45 

Thr Ser Asn Asp Glu Ser Gly Glu Thr Cys Phe Ser Gly His Asp Glu 
50 55 60 

Glu Gin He Lys Leu Met Asn Glu Asn Cys He Val Leu Asp Trp Asp 
65 10 lb 80 

Asp Asn Ala He Gly Ala Gly Thr Lys Lys Val Cys His Leu Met Glu 
85 90 95 

Asn He Glu Lys Gly Leu Leu His Arg Ala Phe Ser Val Phe He Phe 
100 105 110 

Asn Glu Gin Gly Glu Leu Leu Leu Gin Gin Arg Ala Thr Glu Lys He 
115 120 ~ 125 

Thr Phe Pro Asp Leu Trp Thr Asn Thr Cys Cys Ser His Pro Leu Cys 
130 135 140 

He Asp Asp Glu Leu Gly Leu Lys Gly Lys Leu Asp Asp Lys He Lys 
145 150 155 * * 160 

Gly Ala He Thr Ala Ala Val Arg Lys Leu Asp His Glu Leu Gly He 
165 170 175 

Pro Glu Asp Glu Thr Lys Thr Arg Gly Lys Phe His Phe Leu Asn Arg 
180 185 190 

He His Tyr Met Ala Pro Ser Asn Glu Pro Trp Gly Glu His Glu He 
195 200 205 

Asp Tyr He Leu Phe Tyr Lys He Asn Ala Lys Glu Asn Leu Thr Val 
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Asn Pro Asn Val Asn Glu Val Arg Asp Phe, Lys Trp Val Ser Pro Asn 
225 230 235 240 

Asp Leu Lys Thr Met Phe Ala Asp Pro Ser Tyr Lys Phe Thr Pro Trp 
24 5 250 255 

Phe Lys He He Cys Glu Asn Tyr Leu Phe Asn Trp Trp Glu Gin Leu 
260 265 270 

Asp Asp Leu Ser Glu Val Glu Asn Asp Arg Gin He His Arq Met Leu 
275 280 285 

<210> 20 
<211> 456 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Consensus 
sequence of four plant B-cyclases 

<400> 20 

Met Asp Thr Leu Leu Lys Thr Pro Asn Leu Glu Phe Leu Pro His Glv 
1 5 10 15 

Phe Val Lys Ser Phe Ser Lys Phe Gly Lys Cys Glu Gly Val Cys Val 
20 25 30 

Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys Glu Asn 
35 40 45 

Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val Val Asd 
50 55 60 

Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val Ala Gin Gin 
65 70 75 80' 

Val Ser Glu Ala Gly Leu Ser Val Cys Ser He Asp Pro Pro Lys Leu 
85 90 * 95 

He Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu Phe Glu Ala Met 
100 105 HO 

Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly Ala Val Tyr He 
115 120 125 

Asp Asp Thr Lys Asp Leu Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin 
130 135 140 

Leu Lys Ser Lys Met Met Gin Lys Cys He Asn Gly Val Lys Phe His 
145 150 155 ~ 160 

Gin Ala Lys Val He Lys Val He His Glu Glu Lys Ser Met Leu lie 
165 170 175 

Cys Asn Asp Gly Thr He Gin Ala Thr Val Val Leu Asp Ala Thr Gly 
180 185 190 

Phe Ser Arg Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 
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195 200 205 

Val Ala Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Lys ' 
210 215 220 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Asn Asn Glu Leu Lys 
225 230 235 240 

Glu Arg Asn Ser He Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser 
245 250 255 

Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly Leu 
260 265 270 

Arg Met Asp Asp He Gin Glu Arg Met Val Ala Arg Leu His Leu Gly 
275 280 285 

He Lys Val Lys Ser He Glu Glu Asp Glu His Cys Val He Pro Met 
290 295 300 

Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly He Gly Gly 
305 310 315 320 

Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr 
325 330 '* 335 

Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He Tyr Leu Gly Ser 
340 345 350 

Glu Ser Ser Gly Glu Leu Ser Ala Glu Val Trp Lys Asp Leu Trp Pro 
355 360 ~ 365 

lie Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly Met Asp He 
370 375 380 

Leu Leu Lys Leu Asp Leu Pro Ala Thr Arg Arg Phe Phe Asp Ala Phe 
385 390 395 400 

Phe Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu 
405 410 415 

Phe Leu Pro Glu Leu He Val Phe Gly Leu Ser Leu Phe Ser His Ala 
420 425 430 

Ser Asn Thr Ser Arg Glu He Met Thr Lys Gly Thr Pro Leu Val Met 
435 440 445 

He Asn Asn Leu Leu Gin Asp Glu 
450 455 

<210> 21 

<211> 524 

<212> PRT 

<213> Arabidopsis thaliana 

<400> 21 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
1 5 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 
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Ser Tyr Arg Asn 
35 

Gly Gly Ser Ser 
50 

Ala Asp Glu Glu 
65 

Val Gin Met Gin 



Asp Lys Leu Pro 
100 

Val lie Gly Cys 
115 

Lys Leu Gly Leu 
130 

Asn Asn Tyr Gly 
145 



Lys Cys lie Glu 



Asp Lys Pro lie 
180 

Leu Leu His Glu 
195 

Tyr Leu Ser Ser 
210 

Arg Leu Val Ala 
225 



Thr Val Ala Ser 



Gly Gly Pro Arg 
260 

Val Glu Asn Ser 
275 

Arg Asp Tyr Thr 
290 

Thr Phe Leu Tyr 
305 



Glu Thr Cys Leu 



Thr Lys Leu Met 
340 

Thr Tyr Glu Glu 



lie Arg Phe Gly 
40 

Gly Ser Glu Ser 
55 

Asp Phe Val Lys 
70 

Gin Asn Lys Asp 
85 

Pro lie Ser lie 



Gly Pro Ala Gly 
120 



Lys Val Gly Leu 
135 



Val Trp Glu Asp 
150 



His Val Trp Arg 
165 

Thr lie Gly Arg 



Glu Leu Leu Arg 
200 

Lys Val Asp Ser 
215 



Cys Asp Asp Asn 
230 

Gly Ala Ala Ser 
245 

Val Cys Val Gin 



Pro Tyr Asp Pro 
280 

Asn Glu Lys Val 
295 



Ala Met Pro Met 
310 

Ala Ser Lys Asp 
325 

Leu Arg Leu Asp 



Glu Trp Ser Tyr 



Leu Cys Ser Val 



Cys Val Ala Val 
60 



Ala Gly Gly Ser 
75 

Met Asp Glu Gin 
90 

Gly Asp Gly Ala 
105 

Leu Ala Leu Ala 



lie Gly Pro Asp 
140 

Glu Phe Asn Asp 
155 



Glu Thr He Val 
170 

Ala Tyr Gly Arg 
185 

Arg Cys Val Glu 



He Thr Glu Ala 
220 

Asn Val He Pro 
235 



Gly Lys Leu Leu 
250 

Thr Ala Tyr Gly 
265 

Asp Gin Met Val 



Arg Ser Leu Glu 
300 

Thr Lys Ser Arg 
315 

Val Met Pro Phe 
330 

Thr Leu Gly lie 
345 

He Pro Val Gly 



Arg Ala Ser Gly 
4 5 



Arg Glu Asp Phe 



Glu He Leu Phe 
80 

Ser Lys Leu Val 
95 



Leu Asp His Val 
110 



Ala Glu Ser Ala 
125 



Leu Pro Phe Thr 



Leu Gly Leu Gin 
160 

Tyr Leu Asp Asp 
175 

Val Ser Arg Arg 
190 

Ser Gly Val Ser 
205 



Ser Asp Gly Leu 



Cys Arg Leu Ala 
240 

Gin Tyr Glu Val 
255 

Val Glu Val Glu 
270 



Phe Met Asp Tyr 
28 5 

Ala Glu Tyr Pro 



Leu Phe Phe Glu 
320 

Asp Leu Leu Lys 
335 

Arg He Leu Lys 
350 



Gly Ser Leu Pro 



2-1 
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355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly,.Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lvs Tyr Ala Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
405 410 415 

Gin He Asn Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 445 

He Val Gin Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arq Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 
485 490 495 

Pro Asn Asn Leu Arg Lys Gly Leu He Asn His Leu lie Ser Asp Pro 
500 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 

<210> 22 
<211> 1898 
<212> DNA 

<213> Adonis palaestina 

AAAGGAGTGT TCTATTAATG TTACTGTCGC ATTCTTGCAA CACTTATATT CAAACTCCAT 60 

TTTCTTCTTT TCTCTTCAAA ACAACAAACT AATGTGAGCA GAGTATCTGG CTATGGAACT 120 

ACTTGGTGTT CGCAACCTCA TCTCTTCTTG CCCTGTGTGG ACTTTTGGAA CAAGAAACCT 180 

TAGTAGTTCA AAACTAGCTT ATAACATACA TCGATATGGT TCTTCTTGTA GAGTAGATTT 24 0 

TCAAGTGAGA GCTGATGGTG GAAGCGGGAG TAGAAGTTCT GTTGCTTATA AAGAGGGTTT 300 

TGTGGATGAA GAGGATTTTA TCAAAGCTGG TGGTTCTGAG CTTTTGTTTG TCCAAATGCA 360 

GCAAACAAAG TCTATGGAGA AACAGGCCAA GCTCGCCGAT AAGTTGCCAC CAATACCTTT 4 20 

TGGAGAATCC GTGATGGACT TGGTTGTAAT AGGTTGTGGA CCTGCTGGTC TTTCACTGGC 4 80 

TGCAGAAGCT GCTAAGCTAG GGTTGAAAGT TGGCCTTATT GGTCCTGATC TTCCTTTTAC 54 0 

AAATAATTAT GGTGTGTGGG AAGACGAGTT CAAAGATCTT GGACTTGAAC GTTGTATCGA 600 

GCATGCTTGG AAGGACACCA TCGTATATCT TGATAATGAT GCTCCTGTCC TTATTGGTCG 660 

TGCATATGGA CGAGTTAGTC GACATTTGCT ACATGAGGAG TTGCTGAAAA GGTGTGTGGA 720 
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GTCAGGTGTA TCATATCTTG ATTCTAAAGT GGAAAGGATC ACTGAAGCTG GTGATGGCCA 7 80 

TAGCCTTGTA GTTTGTGAAA ATGAGATCTT TATCCCTTGC AGGCTTGCTA CTGTTGCATC 840 

TGGAGCAGCT TCAGGGAAAC TTTTGGAGTA TGAAGTAGGT GGCCCTCGTG TTTGTGTCCA 900 

AACCGCTTAT GGGGTGGAGG TTGAGGTGGA GAACAATCCA TACGATCCCA ACTTAATGGT 960 

ATTCATGGAC TACAGAGACT ATATGCAACA GAAATTACAG TGCTCGGAAG AAGAATATCC 1020 

AACATTTCTC TATGTCATGC CCATGTCGCC AACAAGACTT TTTTTTGAGG AAACCTGTTT 108 0 

GGCCTCAAAA GATGCCATGC CATTCGATCT ACTGAAGAGA AAACTGATGT CACGATTGAA 114 0 

GACTCTGGGT ATCCAAGTTA CAAAAGTTTA TGAAGAGGAA TGGTCATATA TTCCTGTTGG 1200 

TGGTTCTTTA CCAAACACAG AGCAAAAGAA CCTAGCATTT GGTGCTGCAG CAAGCATGGT 12 60 

GCATCCAGCA ACAGGCTATT CGGTTGTACG GTCACTGTCA GAAGCTCCAA AATATGCTTC 132 0 

TGTAATTGCA AAGATTTTGA AG C AAG AT AA CTCTGCGTAT GTGGTTTCTG GACAAAGTAG 1380 

TGCAGTAAAC ATTTCAATGC AAGCATGGAG CAGTCTTTGG CCAAAGGAGC GAAAACGTCA 14 4 0 

AAGAGCATTC TTTCTTTTTG GATTAGAGCT TATTGTGCAG CTAGATATTG AAGCAACCAG 1500 

AACATTCTTT AGAACCTTCT TCCGCTTGCC AACTTGGATG TGGTGGGGTT TCCTTGGGTC 15 60 

TTCACTATCA TCTTTCGATC TCGTCTTGTT TTCCATGTAC ATGTTTGTTT TGGCGCCAAA 1620 

CAGCATGAGG ATGTCACTTG TGAGACATTT GCTTTCAGAT CCTTCTGGTG CAGTTATGGT 1680 

AAGAGCTTAC CTCGAAAGGT AGTCTCATCT ATTATTAAAC TCTAGTGTTT CACCAAATAA 17 4 0 

ATGAGGATCC TTCGAATGTG TATATGATCA TCTCTATGTA TATCCTGTAC TCTAATCTCA 1800 

TAAAGTAAAT GCCGGGTTTG ATATTGTTGT GTCAAACCGG CCAATGATAT AAAGTAAATT 18 60 

TATTGATACA AAAGTAGTTT TTTTCCTTAA AAAAAAAA 18 98 

<210> 23 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 23 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 
1 5 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 " 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

£3 
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Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 " 95 

Lys Leu Pro Pro He Pro Phe Gly Glu Serial Met Asp Leu Val Val 
100 105 HO 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arq 
14 5 150 155 ' " 160 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu lie Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg lie Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Glu lie Phe lie Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 * 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
38 5 390 395 400 

Tyr Ala Ser Val He Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 



2* 
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405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu lie Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 * 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe • 
465 470 475 * 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 

<210> 24 
<211> 1370 
<212> DNA 
<213> Potato 

<400> 24 



TAGCGGAGGA 


TGAGTTCAAA 


GATCTTGGTC 


TTCAAGCCTG 


CATTGAACAT 


GTTTGGCTGG 


60 


GATACCATTG 


TATATCTTGA 


TGATGATGAT 


CCTATTCTTA 


TTGGCCGTGC 


CTATGGAAGA 


120 


GTTAGTCGCC 


ATTTACTGCA 


CGAGGAGTTA 


CTCAAAAGGT 


GTGTGGAGGC 


AGGTGTTTTG 


180 


TATCTAAACT 


CGAAAGTGGA 


TAGGATTGTT 


GAGGCCACAA 


ATGGCCACAG 


TCTTGTAGAG 


240 


TGCGAGGGTG 


ATGTTGTGAT 


TCCCTGCAGG 


TTTGTGACTG 


TTGCATCGGG 


AGCAGCCTCG 


300 


GGGAAATTCT 


TGCAGTATGA 


GTTGGGAGGT 


CCTAGAGTTT 


CTGTTCAAAC 


AGCTTATGGA 


360 


GTGGAAGTTG 


AGGTCGATAA 


CAATCCATTT 


GACCCGAGCC 


TGATGGTTTT 


CATGGATTAT 


420 


AGAGACTATG 


TCAGACACGA 


CGCTCAATCT 


TTAGAAGCTA 


AATATCCAAC 


ATTTCTCTAT 


480 


GCCATGCCCA 


TGTCTCCAAC 


ACGAGTCTTT 


TTCGAGGAAA 


CTTGTTTGGC 


TTCAAAAGAT 


540 


GCAATGCCAT 


TCGATCTGTT 


AAAGAAAAAA 


TTGATGTTAC 


GATTGAACAC 


CCTCGGTGTA 


600 


AGAATTAAAG 


AAATTTATGA 


GGAGGAATGG 


TCTTACATAC 


CAGTTGGAGG 


ATCTTTGCCA 


660 


AATACAGAAC 


AAAAAACACT 


TGCATTTGGT 


GCTGCTGCTA 


GCATGGTTCA 


TCCAGCCACA 


720 


GGTTATTCAG 


TCGTCAGATC 


ACTGTCTGAA 


GCTCCAAAAT 


GCGCCTTCGT 


GCTTGCAAAT 


780 


ATATTACGAC 


AAAAT CAT AG 


CAAGAATATG 


CTTACTAGTT 


CAAGTACCCC 


GAGTATTTCA 


840 


ACTCAAGCTT 


GGAACACTCT 


TTGGCCACAA 


GAACGAAAAC 


GACAAAGATC 


GTTTTTCCTA 


900 



BNSDOOID: <WO AflRaftRSAi IA> 



WO 99/63055 PCT/US 99/1 2121 

TTTGGACTGG CTCTGATATT GCAGCTGGAT ATTGAGGGGA TAAGGTCATT TTTCCGCGCG 960 

TTCTTCCGTG TGCCAAAATG GATGTGGCAG GGATTTCTTG GTTCAAGTCT TTCTTAGCAG 102 0 

ACCTCATGTT ATTTGCCTTC TACATGTTTA TTATTGCACC AAATGACATG AGAAGAGGCT 108 0 

TAATCAGACA TCTTTTATCT GATCCTACTG GTGCAACATT GATAAGAACT TATCTTACAT 1140 

TTTAGAGTAA ATTCCTCCTA CAATAGTTGT TGAAAGAGGC CTCATTACTT CAGATTCATA 1200 

ACAGAAATCG CGGTCTCTCG AGGCCTTGTA TATAACATTT TCACTAGGTT AATATTGCTT 12 60 

GAATAAGTTG CACAGTTTCA GTTTTTGTAT CTGCTTCTTT TTTGTCCAAG ATCATGTATT 1320 

GACCAATTTA TATACATTGC CAGTATATAT AAATTTTATA AAAAAAAAAA 1370 

<210> 25 
<211> 377 
<212> PRT 
<213> Potato 

<400> 25 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys He Glu His Val Trp 
1 5 10 15 

Arg Asp Thr He Val Tyr Leu Asp Asp Asp Asp Pro He Leu He Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His Glu Glu Leu Leu 
35 40 45 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn Ser Lys Val Asp 
50 55 60 

Arg He Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
65 70 75 80 

Asp Val Val He Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 
85 90 95 

Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly Pro Arg Val Ser Val 
100 105 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 120 125 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp 
130 135 140 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 
165 170 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
180 185 190 

Asn Thr Leu Gly Val Arg He Lys Glu He Tyr Glu Glu Glu Trp Ser 
195 200 205 



7X 



BNSDOCID: <WO 9963055A1 IA> 



• 



WO 99/63055 ^^ PCT/US99/12121 

Tyr lie Pro ValGly Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu 
210 215 220 

Ala Phe Gly Ala Ala, Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 230 235 240 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
245 250 255 

Asn He Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 270 

Thr Pro Ser He Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 285 

Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu Ala Leu He Leu 
290 295 300 

Gin Leu Asp He Glu Gly lie Arg Ser Phe Phe Arg Ala Phe Phe Ara 
305 310 315 ~ 320 

Val Pro Lys Met Met Trp Gly Phe Leu Gly Ser Ser Leu Ser Xaa Ala 
325 330 335 

Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He lie Ala Pro Asn Asp 
340 345 350 

Met Arg Arg Gly Leu lie Arg His Leu Leu Ser Asp Pro Thr Gly Ala 
355 360 365 

Thr Leu He Arg Thr Tyr Leu Thr Phe 
370 375 

<210> 26 
<211> 533 
<212> PRT 

<213> Chimeric lettuce/potato 
<400> 26 

Met Glu Cys Phe Gly Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
15 10 is 

Thr Cys Pro Arg Phe Thr Asp Cys Asn lie Arg His Lys Phe Ser Leu 
2 0 25 30 

Leu Lys Gly Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Ara 
35 .40 45 

Gin lie Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lvs Gin 
50 55 60 

Gly lie Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 " so 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin lie Pro lie Gly Asn Cys lie 
100 105 no 

Leu Asp Leu Val Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 



BNSDOCID: <WO 9963055Al_IA> 
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115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu He Gly. Pro Asd' 
130 135 140 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin . Asp Glu Phe He Glv 
145 150 155 * 160 

Leu Gly Leu Glu Gly Cys He Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asp Asp Ala Asp Pro He Arg He Gly Arg Ala Tyr Gly Ara 
180 185 190 • 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg He Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu He Glu Cys Glu Gly Asn He Thr He Pro 
225 230 235 240 

Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245 250 255 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Ser Val Gin Thr Ala Tyr Gly 
260 265 . 270 

Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp Pro Ser Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Tyr Val Arg His .Asp Ala Gin Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro Met Ser Pro Thr Arq 
305 310 315 320 

Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe 
325 330 335 

Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu Asn Thr Leu Gly Val 
340 345 350 

Arg He Lys Glu He Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu Ala Phe Gly Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 ~ 400 

Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala Asn He Leu Arg Gin 
405 410 415 

Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser Thr Pro Ser He Ser 
420 425 430 

Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu Arg Lys Arg Gin Arq 
435 440 445 



2* 
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Ser Phe Phe Leu Phe Gly Leu Ala Leu He Leu Gin Leu Asp lie Glu 
450 455 460 

Gly He Arg Ser Phe Phe Arg Ala Phe Phe Arg Val Pro Lys Trp Met 
465 470 475 4 80 

Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa Ala Asp Leu Met Leu 
485 490 495 

Phe Ala Phe Tyr Met Phe He He Ala Pro Asn Asp Met Arg Arg Gly 
500 505 510 

Leu He Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Leu He Arg 
515 520 525 

Thr Tyr Leu Thr Phe 
530 

<210> 27 
<211> 374 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 27 

Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin Lys Cys He Glu His Val 
1 5 10 15 

Trp Arg Glu Thr He Val Tyr Leu Asp Asp Asp Lys Pro lie Thr He 
20 25 " 30 

Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg Leu Leu His Glu Glu Leu 
35 40 45 

Leu Arg Arg Cys Val Glu Ser Gly Val Ser Tyr Leu Ser Ser Lys Val 
50 55 60 

Asp Ser He Thr Glu Ala Ser Asp Gly Leu Arg Leu Val Ala Cys Asp 
65 70 75 " 80 

Asp Asn Asn Val lie Pro Cys Arg Leu Ala Thr Val Ala Ser Gly Ala 
85 90 95 

Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val Gly Gly Pro Arg Val Cys 
100 105 J no 

Val Gin Thr Ala Tyr Gly Val Glu Val -Glu Val Glu Asn Ser Pro Tyr 
115 120 125 

Asp Pro Asp Gin Met Val Phe Met Asp Tyr Arg Asp Tyr Thr Asn Glu 
130 135 140 

Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro Thr Phe Leu Tyr Ala Met 
145 150 155 ~ 160 

Pro Met Thr Lys Ser Arg Leu Phe Phe Glu Glu Thr Cys Leu Ala Ser 
165 170 175 

Lys Asp Val Met Pro Phe Asp Leu Leu Lys Thr Lys Leu Met Leu Arg 
180 185 " 190 

Leu Asp Thr Leu Gly He Arg He Leu Lys Thr Tyr Glu Glu Glu Trp 



BNSDOCID: <WO 9963055A1 IA> 
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195 200 205 

Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn ' 
210 215 220 

Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr 
225 230 235 240 

Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Tyr Ala Ser Val He 
245 250 ~ 255 

Ala Glu He Leu Arg Glu Glu Thr Thr Lys Gin He Asn Ser Asn He 
260 265 270 

Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro Pro Glu Arg Lys Arg Gin 
275 280 285 

Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu He Val Gin Phe Asp Thr 
290 295 300 

Glu Gly He Arg Ser Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp 
305 310 315 320 

Met Trp Gin Gly Phe Leu Gly Ser Thr Leu Thr Ser Gly Asp Leu Val 
325 330 335 

Leu Phe Ala Leu Tyr Met Phe Val He Ser Pro Asn Asn Leu Arg Lys 
340 345 350 

Gly Leu He Asn His Leu He Ser Asp Pro Thr Gly Ala Thr Met He 
355 360 365 

Lys Thr Tyr Leu Lys Val 
370 

<210> 28 
<211> 1002 
<212> DNA 

<213> Adonis palaestina 
<400> 28 



ATTCATCTTC 


AGCAGCGCTG 


TCGTACTCTT 


TCTATATCTT 


CTTCCATCAC 


TAACAGTAGT 


60 


CGCCGACGGT 


TGAATCGGCT 


ATTCGCCTCA 


ACGTCAACTA 


TGGGTGAAGT 


CACTGATGCT 


120 


GGAATGGATG 


CTGTTCAGAA 


GCGGCTCATG 


TTCGACGACG 


AATGTATTTT 


GGTGGATGAG 


180 


AATGACAAGG 


T.CGTCGGGCA 


TGATTCCAAA 


TACAACTGTC 


ATTTGATGGA 


AAAGATAGAG 


240 


GCAGAAAATT 


TGCTTCACAG 


AGCCTTCAGT 


GTTTTCTTGT 


TCAACTCAAA 


ATATGAATTG 


300 


CTTCTTCAGC 


AACGATCCGC 


CACAAAGGTA 


ACATTCCCGC 


TCGTATGGAC 


AAAC AC AT G T 


360 


TGCAGTCATC 


CTCTCTTTCG 


TGATTCCGAG 


CTCATAGAAG 


AAAATTATCT 


CGGTGTACGA 


420 


AACGCTGCAC 


AAAGAAAGCT 


TTTAGACGAG 


CTAGGCATTC 


CAGCTGAAGA 


TGTCCCAGTT 


480 


GATGAATTTA 


CTCCTCTTGG 


TCGCATTCTT 


TACAAAGCTC 


CATCTGACGG 


CAAATGGGGA 


540 


GAGCACGAAT 


TGGACTATCT 


CCTATTTATT 


GTCCGAGATG 


TGAAATACGA 


TCCAAACCCA 


600 



2o 
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GATGAAGTTG 


CTGATGCTAA 


GTATGTTAAT 


CGCGAGGAGT 


TGAGAGAGAT 


AC T G AG AAAA 


660 


GCTGATGCTG 


GTGAAGAGGG 


ACTCAAGTTG 


TCTCCTTGGT 


TTAGATTGGT 


TGTTGATAAC 


720 


TTTTTGTTCA 


AGTGGTGGGA 


TCATGTAGAG 


CAGGGTACGA 


TTAAGGAAGT 


TGCTGACATG 


780 


AAAACTATCC 


ACAAGTTGAC 


TTAAGAGGAC 


TTCTCTCCTC 


TGTTCTACTA 


TTTGTTTTTT 


840 


GCTACAATAA 


GTGGGTGGTG 


ATAAGCAGTT 


TTTCTGTTTT 


CTTTAATTTA 


TGGCTTTTGA 


900 


ATTTGCCTCG 


ATGTTGAACT 


TGTAACATAT 


TTAGACAAAT 


ATGAGACCTT 


GTAAGTTGAA 


960 


TTTGAGGCTG 


AATTTATATT 


TTTGGGAACA 


TAATAATGTT 


AA 




1002 


<210> 29 
<211> 1271 
<212> DNA 

<213> Adonis palaestina 










<400> 29 
TTTTAAAGCT 


CTTTCGCTCC 


ACCACCATCA 


AAGCCAGCCA 


AATTTCTCTG 


TACAAAAGTT 


60 


AAAAACACCG 


CTTTGGGCTT 


TGGCCCCTCC 


ATATCGGAAT 


CCTTGTTTAC 


G AT ACGC AT C 


120 


TAAACCAGTA 


ATTCTCGGTT 


TTAATTTGTT 


TCCTAAATTA 


GGCCCCTTTC 


CGGAATCCCG 


180 


AGAATTATGT 


CGTCGATCAG 


GATTAATCCT 


TTATATAGTA 


TCTTCTCCAC 


CACCACTAAA 


240 


ACATTATCAG 


CTTCGTGTTC 


TTCTCCCGCT 


GTTCATCTTC 


AGCAGCGTTG 


TACGTACTCT 


300 


TTCTATTTCT 


TCTTCCATCA 


CTAACAGTCC 


TCGCCGAGGG 


TTGAATCGGC 


TGTTCGCCTC 


360 


AACGTCGACT 


ATGGGTGAAG 


TCGCTGATGC 


TGGTATGGAT 


GCCGTCCAGA 


AGCGGCTTAT 


420 


GTTCGACGAT 


GAATGTATTT 


TGGTGGATGA 


GAATGACAAG 


GTCGTCGGAC 


ATGATTCCAA 


480 


ATACAACTGT 


CATTTGATGG 


AAAAGATAGA 


GGCAGAAAAC 


TTGCTTCACA 


GAGCCTTCAG 


540 


TGTTTTCTTA 


T.TCAACTCAA 


AATACGAGTT 


GCTTCTTCAG 


CAACGATCTG 


CAACGAAGGT 


600 


AACATTCCCG 


CTCGTATGGA 


CAAACACCTG 


TTGCAGCCAT 


CCCCTCTTCC 


GTGATTCCGA 


660 


ACTCATAGAA 


GAAAATTTTC 


TCGGGGTACG 


AAACGCTGCA 


CAAAGGAAGC 


TTTTAGACGA 


720 


GCTAGGCATT 


CCAGCTGAAG 


ACGTACCAGT 


TGATGAATTC 


ACTCCTCTTG 


GTC^GCATTCT 


780 


TTACAAAGCT 


CCATCTGACG 


GAAAATGGGG 


AGAGCACGAA 


CTGGACTATC 


TTCTGTTTAT 


84 0 


TGTCCGAGAT 


GTGAAATACG 


ATCCAAACCC 


AGATGAAGTT 


GCTGACGCTA 


AGTACGTTAA 


900 


TCGCGAGGAG 


TTGAAAGAGA 


TACTGAGAAA 


AGCTGATGCA 


GGTGAAGAGG 


GAATAAAGTT 


960 


GTCTCCTTGG 


TTTAGATTGG 


TTGTGGATAA 


CTTTTTGTTC 


AAGTGGTGGG 


ATCATGTAGA 


1020 


GGAGGGGAAG 


ATTAAGGACG 


TCGCCGACAT 


GAAAACTATC 


CACAAGTTGA 


CTTAAGAGAA 


1080 


AGTCTCTTAA 


GTTCTACTAT 


TTGGTTTTTG 


CTTCAATAAG 


TGGATGGTGA 


TGAGCAGTTT 


1140 


TTATGCTTCC 


TTTAATTTTG 


GCTTTTCAAT 


TTGCTTTATG 


TGTTGAACTT 


GTAACATATT 


1200 


TAGTCAAATA 


TGAGACCTTG 


TGAGTTGAAT 


TTGAGGTTAT 


ATTTATAGTT 


TTGGGAACAT 


1260 



BNSDOCID: <WO 99630SSA 1 IA> 
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AAAAAAAAAA A 

<210> 30 
<211> 1109 
<212> DNA 

<213> Haematococcus pluvialis 



<400> 30 



TGGAACCTGG 


CCCGGCGGCA 


GTCCGATGCC 


GCGATGCTTC 


GTTCGTTGPT 




DU 


ACGCATATCC 


CGCGCGTGAA 


CTCCGCCCAG 


CAGCCCAGCT 


GTGCACACGC 


GCGAPTPPAP. 


J. ^/ 


TTTAAGCTCA 


GGAGCATGCA 


GCTGCTTGCC 


GAGGACCGCA 


CAGACCACAT 


GAGGGGTGPA 


180 


AGCACCTGGG 


CAGGCGGGCA 


GTCGCAGGAT 


GAGCTGATGC 


TGAAGGACGA 


GTGCATCTTA 




GTGGATGCTG 


ACGACAACAT 


CACAGGCCAT 


GCCAGCAAGC 


TGGAGTGCCA 


CAAATTPPTA 


j \j \j 


CCACATCAGC 


CTGCAGGCCT 


GCTGCACCGG 


GCCTTCTCTG 


TGTTCCTGTT 


TGAPGAPPAP 


? fin 

«J o u 


GGGCGACTGC 


TGCTGCAACA 


GCGTGCACGC 


TCAAAAATCA 


CCTTCCCAAG 


TGTGTPPAPP 


*4 U 


AACACCTGCT 


GCAGCCACCC 


TCTACATGGG 


CAGACCCCAG 


ATGAGGTGP^A 


PPA Z\PTZ\ APP 


il Q C\ 
*i O U 


CAGGTGGCCG 


ACGGCACAGT 


ACCTGGCGCA 


AAAGCTGCTG 








GAGCTGGGGA 


TACCAGCGCA 


CCAGCTGCCG 


GPAAGPGPGT 


x X VjO'o 1 1 x 


P ft P P pr T T 1 T C 


oUU 


CACTACTGTG 


CCGCGGACGT 


GCAGCCGGCT 


GCGACACAAT 


pagpgptptp. 




^ a c\ 
bbU 


GAGATGGACT 


ACATCTTATT 

iivil X X X £x X X 




rlriL- \J x K^rW.,^ x 


tpppppppzv n 


bbb i. VjACCaAb 


~i *~i r\ 


GTGGACGAAG 


TCAGGTACGT 


GACGCAAGAG 


GAGCTGCGGC 


AG AT GAT G C A 


GCCGGACAAC 


780 


GGGTTGCAAT 


GGTCGCCGTG 


GTTTCGCATC 


ATCGCCGCGC 


GCTTCCTTGA 


GCGTTGGTGG 


84 0 


GCTGACCTGG 


ACGCGGCCCT 


AAACACTGAC 


AAAC AC GAG G 


ATTGGGGAAC 


GGTGCATCAC 


900 


ATCAACGAAG 


CGTGAAGGCA 


GAAGCTGCAG 


GATGTGAAGA 


CACGTCATGG 


GGTGPAATTP 


o u 


CGTACTTGGC 


AGCTTCGTAT 


CTCCTTTTTC 


TGAGACTGAA 


CCTGCAGAGC 


TAGAGTPAAT 




GGTGCATCAT 


ATTCATCGTC 


TCTCTTTTGT 


TTTAGACTAA 


TCTGTAGCTA 


GAGTCACTGA 


1080 


TGAATCCTTT 


ACAACTTTCA 


AAAAAAAAA 








1109 


<210> 31 
<211> 985 
<212> DNA 

<213> Lactuca sativa 












<400> 31 
TGCCAAAATG 


TTGAAATTTC 


CCCCTTTTAA 


AACCATTGCT 


ACCATGATCT 


CTTCTCCATA 


60 


TTCTTCCTTC 


TTGCTGCCTC 


GGAAATCTTC 


TTTCCCTCCA 


ATGCCGTCTC 


TCGCAGCCGC 


120 


TAGTGTTTTC 


CTCCACCCTC 


TTTCGTCTGC 


CGCTATGGGC 


GATTCCAGCA 


TGGATGCTGT 


180 


CCAGCGACGT 


CTCATGTTCG 


ATGACGAATG 


CATTTTGGTG 


GATGAGAATG 


ACAAAGTGGT 


240 


TGGCCATGAT 


ACTAAATACA 


ATTGTCATTT 


GATGGAGAAG 


ATTGAAAAGG 


GAAATATGCT 


300 



BNISDOCID: <WO 9963055A1_IA> 
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ACACAGAPPA 


X 1 ^MO i \J 1 O 1 


1 v_ l 1 o 1 1 L-M-H 


TTCTGPAAPP 


AAPPTPAPTT 


A v_,V_ 111 \j \j 1 


ATAPAPPPAP 


aptp apptt a 






PATPA APTPP 




ATTGGGTPPP 


ATTPTATAPA 


nuo^\*uL.n 1 \— 


TTACCTGCTG 


TTTATGGTAC 


GTGATGTTGP 


TGTAAAATAT 


GTGAACCGGG 


AAGAGCTGAA 


AGAGGGTGTG 


AAGCTGTCCC 


CGTGGTTCAA 


GTGGGATCGA 


PTPCATAAGG 


PAAPPPTAAP 


ACTCACATAA 


AAACACTACA 


CTAGTAGGAG 


GAAATTGAAA 


TTCAGATGAA 


TGCTTGTATT 


TTTGCTACCT 


TATCAGAAAA 


AAAAA 


<210> 32 
<211> 988 
<212> DNA 

<213> Lactuca sativa 




<400> 32 
TATTCGCTTC 


AAAATCTCTT 


PP ATT A APTP 


TCCGCCGGCG 


CACTTTCACC 


A PP AT A arrr 


CTGTCCAGAG 


ACGTCTCATG 


ill un 1 1 o 


TTCTTGGGCA 


TGATACCAAA 


T APA ATTPTP 


TGCTTCATAG 


AGCATTCAGT 


P TATTTTT AT* 
v3 X r\ X l i i i/ii 


AAAGGTCAGA 


AACCAAGGTG 


APATTTPPTT 


CACTATACAG 


AGAATCGGAG 


TTAATTCCCG 

x x rmi x v^vvvj 


AGAGGAAGCT 


TCTAGATGAA 


CTCGGTATPC 


CAACTTTAGG 


TCGCATGTTG 


TACAAGGPTP 


TTGATTACCT 


ACTCTTCCTC 


GTGCGTGACG 

VJ X VJuU X VJiiVVJ 


CGGACATTAG 


ATACGTGAAC 


PAAGAAGAPT 


GTGAGGAGGG 


TTTGAAATTG 


TCCCCATGGT 


AATGGTGGGA 


TCATGTCCAA 


AAGGGGACAC 


ATAAGTTGAT 


ATGAAAAATG 


GTTAATATTT 


TGTTCAAGTC 


TCGGTCCTTC 


TTTTTTTAAC 


TATTGTGTAC 


TTGTAACGTA 


GGCCCTTTGG 



BNSDOCID: <WO 9963055A1_IA> 
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CTCGAAATAT 


GAATTACTCC 


TTCAGPAAPP 




ATGGACAAAC 


ACGTGTTGCA 


GCCATPPAPT 




CGCCCTTGGG 


GTGAGGAATG 


PTPPAPAPAP 


don 

4 OU 


AGCAGATGTT 


CCGGTTGATG 


APTTP APTPP 


D H U 


GGATGGAAAG 


TGGGGAGAAC 


ATGAACTTGA 


con 

vJ W VJ 


TTTGGATCCG 


AACCCAGATG 


AAGTGAAAGA 


660 


GGAATTGGTA 


AGGAAGGCGG 


ATGCTGGTGA 


720 


ATTGATTGTC 


GATAATTTCT 


TGTTTCAGTG 


780 


CGAAGCTATT 


GATATGAAAA 


CAATCCACAA 


840 


AGAGGATTAT 


ATGAGATATT 


TGTTATATGT 


900 


TATTTCTATT 


TGGACAAACT 


TCAACTTCTT 


960 








985 


CTCAAATCTC 


CACCTTCGCC 


GGTCTTAATP 


vj \J 


CCGCCATGGG 


TGACGATTCC 


GGPATPPAPP 




AATGCATTTT 


GGTTGATGAA 


AATGAPAATP 


1 Qf) 

± O VJ 


ACTTGATGGA 


GAAGATTGAG 


AAAGATAATT 


24 0 


TCAATTCAAA 


ATACGAATTA 


CTCCTTCAGC 


300 


TGGTATGGAC 


AAACACCTGT 


TGCAGCCATC 


360 


AAAATGCCCT 


TGGGGTCAGA 


AATGCTGCAC 


420 


CTGCTGAAGA 


TGTTCCAGTT 


GATGAGTTCA 


480 


CATCTGATGG 


AAAATGGGGT 


GAACATGAAG 


540 


TTGCCGTGAA 


CCCAAACCCT 


GATGAGGTGG 


600 


TAAAAGAGTT 


ACTAAGGAAG 


GCGGATGCGG 


660 


TTAGGCTAGT 


GGTGGACAAC 


TTCTTGTTCA 


720 


TCAATGAAGC 


AATTGACATG 


AAAACCATTC 


780 


ATGGTGGTGG 


TTTGGAGCTA 


ATAATTTGTG 


840 


GTTTTTTTTT 


TTTCTTTTAT 


TGGGAGTGTT 


900 


TTACGCTTTA 


AGAGTTTAAT 


AAAGAACCAC 


960 



33 
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CGTTAATTTA AAAAAAAAAA AAAAAAAA " " 988 

<210> 33 
<211> 1874 
<212> DNA 

<213> Chlamydomonas reinhardtii 
<400> 33 



GGCACGAGCT 


CGAGTTTGTT 


TTACCATGAC 


ATCGGGAATT 


TGGAAGCTTG 


AACTAPPTPA 


VJ Vj 


ATTACTCAAG 


TAACTCGCGG 


CAACACATTT 


CGCGCGCCAT 


CGCTGTTTTC 


TCTGCTCPAG 


120 


CTACCGAGCA 


GCATTGCTTT 


AGATCGCTTT 


GATGTCATAA 


ACTCCCACTT 


ATATGAGATC 


180 


CAGTTTCATC 


GAGCCCAAGC 


CCAGAGCGCA 


ACCTGTCTTA 


AGCCGCGGCA 


GGGCGTCCAT 


£. H \J 


GCGCCTCGCG 


CAAAGCCGTG 


CTCTCGTTGC 


GCGTGTCAGC 


TCCGCCCTGT 


GGPPPPPAPP 




AGGACTTTCA 


CAGGCTCAAA 


GCGTTGCGGT 


GCGAATGGCG 


AGTTCGTCAA 


PPTPPPA APP 


"3 £ n 


CACGGGCCTG 


AGCCAGGATG 


ACTTCATGCA 


GCGGGACGAG 


TGCTTGGTGG 


TftftAPP APPA 




GGACCGGCTG 


CTAGGCACCG 


CCAACAAGTA 


CGACTGCCAC 


CGCTTCGAGG 


PPftPPA APr-n 


*3 o U 


CCAGCCCTGC 


GGCCGCCTGC 


ACCGCGCCTT 


CTCCGTGTTC 


CTGTTPAGPP 


PPPAPPPPPP 


O H U 


ACTGCTGCTG 


CAGCAGCGCG 


CAGCCAGCAA 


GGTGACGTTC 


CCGGGTGTPT 

^^y\JV3VJ X V_J 1 \J X 


pp a pp a a p 2\ r* 


o U U 


CTGCTGCTCG 


CACCCGCTGG 


CGGGCCAGGC 


GCCGGACGAG 


GTGGAPPTPP 




£ n 


AGCCTCGGGC 


CAGGTGCCGG 


GCATCAAGGC 


GGCGGCGGTG 


PGPAAPPTPP 


A PP A PP Zi fPT 


Ton 


GGGGATACCG 


CCGGAGCAGG 


TTCCCGCCTC 


CTCCTTCTCC 


TTCCTPAPGP 


PTPTPPAPTZi 


*7 q n 


CTGCGCCGCC 


GACACCGCCA 


CGCACGGCCC 


GGCGGCGGAG 


TGGGGCGAGP 


APPAPPTPPA 


O fl u 


CTACGTGCTG 


TTCGTGCGGC 


CGCAGCAGCC 


CGTCAGCCTG 


CAGCCCAACC 


PAPAPPAPPT 


QOO 


GGACGCCACG 


CGCTACGTGA 


CGCTGCCGGA 


GCTTCAGTCC 


ATGATGGCGG 


APPCPPPPPT 


Vj VJ 


CAGCTGGAGC 


CCCTGGTTCC 


GCATCCTGGC 


CACACAGCCC 


GCCTTCCTGC 


CCGCCTPPTP 


J. VJ £. \J 


GGGCGACCTG 


AAGCGGCGCT 


GGCGCCCGGG 


CGGCAGCCGA 


CTGTCGGACT 


GGGGCACCAT 


1080 


CCACCGCGTC 


ATGTGAAGAA 


AAAGGGGAAG 


CAGGGGCGGG 


AGCGGGGGAT 


GAATGGGAAT 


114 0 


GTGAATGCGA 


ttptpatppp 

x x \j x oni ovo 


bvu i boon x vj 


nbb itl bAAb 


AC ACjG (j G G AA 


AATCGGGGGG 


1200 


CGGGCGTGAG 


CGTGTGTGTA 


CGTGAGCGAC 


AAAGCCGGGA 


GGCGGACCGC 


GCGATGGGTA 


1260 


CATGTGTGTG 


CGGAGGGTCG 


GTGGGTCGGT 


CGGTTGCGCG 


GCATAGCGTG 


TTGTGTGTGT 


1320 


GCGGCTGCAG 


GGGTATGTGG 


GCACCCGGGC 


ACGGAGGAGA 


AGGCACACGC 


AGGTGGCGCG 


1380 


GAGGTGTGTC 


AGGGGCCATG 


GGCGGGCCTC 


ACTCCTGGTC 


GTGCCCAGTG 


GTCTCGTGGG 


1440 


CAGAGTGGCA 


GGGGCTGCAC 


CCATATGAGC 


GGCGCACTGC 


CGCGCTGGGC 


TAAGTCCTTA 


1500 


TCACTTGGTG 


AGGTGGGGCG 


AGGTGGCTGT 


GGGCGGCGGG 


CGCAGTGGCA 


GAAGGACACG 


1560 


GTGTGTGAGC 


GGTGGAGCTC 


TGGCCGTGCC 


GGCCGTGAGG 


GGCGGATAGC 


GATATGACGT 


1620 



SS 
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TGTGCTTPPP 


ran t zx n t p 






GAGAAGCGGG 


CGGTGGCAGG 


1680 




f" >r P/' > /~ ,, Tvf"'r■" , 7\P , P , 


Ob 1 1 GGGGAG 


GTGCCACCTG 


CAGGCGCGGC 


GCCGGGCGGG 


174-0 


CCTGAGTAAT 


GGGCGCCTGA 


GTAGTGGCGG 


CCACAGGAGG 


CGCAGGAGGC 


AGCAGCAGGA 


1800 


GGACGAGCTG 


GAGGGACCCG 


TTGGCAACCC 


AAGGTTGCGC 


GTGTAACATA 


GTGGCCATAC 


1860 


AAAAAAAAAA 


AAAA 










1874 


<210> 34 
<211> 954 
<212> DNA 

<213> Tagetes erecta 












<400> 34 
CCAAAAACAA 


CTCAAATCTC 


CTCCGTCGCT 


CTTACTPPPP 


CTXTCCZCZTCZim 
\->r\. 1 \J\J\J 1 bnb 


oAL 1 LbbbbA 


60 


TGGATGCTGT 


TCAGCGACGT 


CTCATGTTTG 


acgatgaatp 




bA 1 brtb J. \j J. o 


120 


ACAATGTGGT 


GGGACATGAT 


ACCAAATACA 


ATTGTCAPTT 


patppapzx tap^ 

on l ounonnb 


7A TTP TV 7\ T\ P» 7\ C 

A I 1 bAAALAb 


loO 


GTAAAATGCT 


GCACAGAGCA 


TTCAGCGTTT 


TTPTATTPA A 


TTPAAA ZiTZi/^ 




^40 


TTCAGCAACG 


GTCTGCAACC 


AAGGTGAPAT 


l iv-v-l 1 J. no l 


A 1 bbAbbAAL. 


Abb I G I TGCA 


300 


GCCATCCACT 


CTACAGAGAA 


TPPGAPPTTP 






T> TV TV /*~* ta 7V TV m 

GTAAGAAATG 


360 


CTGCACAGAG 


GAAGCTGTTG 






1 oAAbA IbiT 


CCCGTTGATC 


4 20 


AGTTTACTCC 


TTTAGGTCGC 




ZiPPr^T/T'Zi tp* 


1 oA 1 bb AAAG 


TGGGGAGAAC 


480 


ATGAACTTGA 


CTACCTACTT 


TTfATA^THfi 

1 X X 1 O^l 


r; a p a r*p t t rr 


r PP* r P 7\ -j\ ~r\ /~- r~* 


TV T\ /^/~ tv /"» Tv rr»i^« 

AACCCAGATG 


54 0 


AAGTGGCGGA 


TATCAAATAT 


GTGAPPAPAA 


nAHTTAAAnr: 


7\PZP , T , P"P' , T , 7\ A P* 
nbb 1 bb 1 riHb 


bAAAbbACjAT 


C f\ f\ 

600 


GCGGGGGAGG 


AGGGTTTGAA 


GCTGTPTPPA 


TPPTTPAPPT 


1 rib I Vjjo 1 I bft 


rp Tv TV /"•rn rp/^»rp rp 

1 AAb i Ibl lb 


660 


TTCAAGTGGT 


GGGATCATGT 


GCAAAAGGGT 


APAPTPAPTP 


ZA Zi r*P Zi Zi T T 1 P* 7A 


l A I bAAAACC 


720 


ATACACAAGC 


TGATATAGAA 


a p Zi c zi rrr t 


Zi Zi CCC* 7A 7A 7A 7\ P* 
/^nbboAAAAb 


1 IbAAGCCTA 


ATAATTCGGG 


780 


TTGGGTCGGG 


TCTACCATCA 


ATTGTTTTTT 


TCTTTTAAGA 


AGTTTTAATC 


TCTATTTGAG 


840 


CATGTTGATT 


CTTGTCTTTT 


GTGTGTAAGA 


TTTTGGGTTT 


CGTTTCAGTT 


GTAATAATGA 


900 


ACCATTGATG 


GTTTGCAATT 


TCAAGTTCCT 


ATCGACATGT 


AGTGATCTAA 


AAAA 


954 


<210> 35 
<211> 1031 
<212> DNA 

<213> Oryza sativa 












<400> 35 
CCTCCCTTTG 


CCTCGCGCAG 


AGGCGGCCGC 


GCCTTCTCCG 


CCGCGAGGAT 


GGCCGGCGCC 


60 


GCCGCCGCCG 


TGGAGGACGC 


CGGGATGGAC 


GAGGTCCAGA 


AGCGGCTCAT 


GTTCGACGAC 


120 


GAATGCATTT 


TGGTGGATGA 


ACAAGACAAT 


GTTGTTGGCC 


ATGAATCAAA 


ATATAACTGC 


180 


CATCTGATGG 


AAAAAATCGA 


ATCTGAAAAT 


CTACTTCATA 


GGGCTTTCAG 


TGTATTCCTG 


240 
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TTCAACTCAA 


AATATGAACT 


CCTACTCCAG 


CAACGATCTG 


CAACAAAGGT 


TACATTf CCT 


300 


CTAGTTTGGA 


CCAACACTTG 


CTGCAGCCAT 


CCTCTGTACC 


GTGAGTCTGA 


GCTTATACAG 


3 60 


GAAAACTACC 


TTGGTGTTAG 


AAATGCTGCT 


CAGAGGAAGC 


TCTTGGATGA 


GCTGGGCATC 




CCAGCTGAAG 


ATGTGCCAGT 


TGACCAATTC 


ACCCCTCTTG 


GTCGGATGCT 


TTACAAGGCC 


4 8 0 


CCATCTGATG 


GAAAATGGGG 


TGAACACGAG 


CTTGACTACC 


TGCTGTTCAT 


CGTCCGCGAC 


54 0 


GTGAAGGTAG 


TCCCGAACCC 


GGACGAAGTG 


GCCGATGTGA 


AATACGTGAG 


CCGTGAGCAG 


600 


CTGAAGGAGC 


TCATCCGCAA 


AGCGGACGCC 


GGAGAGGAAG 


G.CCTGAAGCT 


GTCTCCCTGG 


660 


TTCCGGCTGG 


TTGTTGACAA 


CTTCCTCATG 


GGCTGGTGGG 


ATCACGTCGA 


GAAAGGCACC 


720 


CTCAACGAGG 


CCGTGGACAT 


GGAGACCATC 


CACAAGCTGA 


AGTAAGGACT 


GCGATGTTGT 


780 


GGCTGGAAAG 


AATGATCCTG 


AAGACTCTGT 


TCTTGTGCTG 


CTGCATATTA 


CTCTTACCAG 


840 






QjAAVjL 1111b 


rri t\ rp f~* rrt rp rp /*> rp /"* 

lAJbl 1 ILlb 


GGTTTGGAGC 


TTGGAAGTGT 


900 


TGGGCTCTGC 


TGACTGAGAG 


ATTCCCTTAT 


AGAGTGTCTA 


TGTTAATTTA 


GCAAACTTCT 


960 


ATATTATACA 


TGATTAGTTA 


ATTGTTCGGT 


GTCTGAATAA 


AGAACAATAG 


CATGTTCCAT 


1020 


GTTTATTTGC 


T 










1031 



<210> 36 
<211> 232 
<212> PRT 

<213> Tagetes erecta 



<400> 36 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
15 10 15 

Asp Asp Glu Cys lie Leu Val Asp Glu Cys Asp Asn Val Val Gly His 
20 25 30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Thr Gly Lys 
35 4 0 4 5 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
65 70 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 95 

Val Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
100 105 110 

Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp Gin Phe 
115 120 125 

Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp 
130 135 140 
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Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp Val Ala 
145 150 155 160 . 

Val Asn Pro Asn Pro. Asp Glu Val Ala Asp He Lys Tyr Val Ser His 
165 170 J ' 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Glv 
180 185 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 

Lys Trp Trp Asp His Val Gin Lys Gly Thr Leu Thr Glu Ala He Asd 
210 215 220 

Met Lys Thr He His Lys Leu He 
225 230 

<210> 37 
<211> 280 
<212> PRT 

<213> Lactuca Sativa 
<400> 37 

Met Leu Lys Phe Pro Pro Phe Lys Thr He Ala Thr Met He Ser Ser 
15 io 15 

Pro Tyr Ser Ser Phe Leu Leu Pro Arg Lys Ser Ser Phe Pro Pro Met 
20 25 30 

Pro Ser Leu Ala Ala Ala Ser Val Phe Leu His Pro Leu Ser Ser Ala 
35 40 45 

Ala Met Gly Asp Ser Ser Met Asp Ala Val Gin Arg Arg Leu Met Phe 
50 55 60 

Asp Asp Glu Cys He Leu Val Asp Glu Asn Asp Lys Val Val Glv His 
65 ™ 75 ' 80 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Lys Glv Asn 

85 90 95 

Met Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
100 105 no 

Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val 
115 120 125 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
130 135 140 

lie Asp Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
145 150 155 i 6 o 

Leu Asp Glu Leu Gly He Pro Gly Ala Asp Val Pro Val Asp Glu Phe 
165 170 175 

Thr Pro Leu Gly Arg He Leu Tyr Lys Ala Ala Ser Asp Gly Lvs TrD 
180 185 190 

Gly Glu His Glu Leu Asp Tyr Leu Leu Phe Met Val Arg Asp Val Gly 



2.-7 
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195 200 205 

Leu Asp Pro Asn Pro Asp Glu Val Lys Asp Val Lys Tyr Val Asn Arg 
.210 215 220 

Glu Glu Leu Lys Glu Leu Val Arg Lys Ala Asp Ala Gly Glu Glu Gly 
225 230 235 240 

Val Lys Leu Ser Pro Trp Phe Lys Leu He Val Asp Asn Phe Leu Phe 
245 250 255 

Gin Trp Trp Asp Arg Leu His Lys Gly Thr Leu Thr Glu Ala He Asp 
260 265 270 

Met Lys Thr He His Lys Leu Thr 
275 280 

<210> 38 
<211> 229 
<212> PRT 

<213> Lactuca Sativa 
<400> 38 

Met Gly Asp Asp Ser Gly Met Asp Ala Val Gin Arg Arg Leu Met Phe 
1 .5 10 15 

Asp Asp Glu Cys He Leu Val Asp Glu Asn Asp Asn Val Leu Gly His 
20 25 30 

Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Lys Asp Asn 
35 40 ■ 45 

Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu 
50 55 60 

Leu Leu Leu Gin Gin Arg Ser Glu Thr Lys Val Thr Phe Pro Leu Val 
65 70 75 80 

Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser Glu Leu 
85 90 95 

He Pro Glu Asn Ala Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu 
100 105 110 

Leu Asp Glu Leu Gly He Pro Ala Glu Asp Val Pro Val Asp Glu Phe 
115 120 125 

Thr Thr Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp 
130 135 140 

Gly Glu His Glu Val Asp Tyr Leu Leu Phe Leu Val Arg Asp Val Ala 
145 150 155 ' 160 

Val Asn Pro Asn Pro Asp Glu Val Ala Asp He Arg Tyr Val Asn Gin 
165 170 - 175 

Glu Glu Leu Lys Glu Leu Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly 
180 185 ^ 190 

Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe 
195 200 205 



3r 
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Lys Trp Trp Asp His Val Gin Lys Gly Thr Leu Asn Glu Ala lie Asp 
210 215 220 

Met Lys Thr lie His . 
225 

<210> 39 
<211> 295 
<212> PRT 

<213> Adonis Palaestina 
<400> 39 

Met Ser Ser lie Arg He Asn Pro Leu Tyr Ser He Phe Ser Thr Thr 
1 5 10 15 

Thr Lys Thr Leu Ser Ala Ser Cys Ser Ser Pro Ala Val His Leu Gin 
20 25 30 

Gin Arg Cys Arg Thr Leu Ser He Ser Ser Ser He Thr Asn Ser Pro 
35 40 45 

Arg Arg Gly Leu Asn Arg Leu Phe Ala Ser Thr Ser Thr Met Gly Glu 
50 55 60 

Val Ala Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu Met Phe Asp 
65 70 75 80 

Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val Gly Tyr Asp 
8 5 90 95 

Ser Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Ala Glu Asn Leu 
100 105 HO 

Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys Tyr Glu Leu 
115 120 125 

Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro Leu Val Trp 
130 135 140 

Thr Asn Thr Cys Cys Ser His Pro Leu Phe Arg Asp Ser Glu Leu He 
!45 150 155 160 

Glu Glu Asn Phe Leu Gly Val Arg Asn Ala Ala Gin Arg Lys Leu Leu 
165 170 175 

Asp Glu Leu Gly He Pro Ala Glu Asp Val Pro Val Asp Glu Phe Thr 
180 185 190 

Pro Leu Gly Arg He Leu Tyr Lys Ala Pro Ser Asp Gly Lys Trp Gly 
195 200 205 

Glu His Glu Leu Asp Tyr Leu Leu Phe He Val Arg Asp Val Lys Tyr 
210 215 220 

Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val Asn Arg Glu 
225 230 235 240 

Glu Leu Lys Glu He Leu Arg Lys Ala Asp Ala Gly Glu Glu Gly He 
245 250 " 255 

Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe Leu Phe Lys 
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260 



265 



270 



Trp Trp Asp His Val Glu Glu Gly Lys lie Lys Asp Val Ala Asp Met 
275 280 ■■ 285 

Lys Thr lie His Lys Leu Thr 



<210> 40 
<211> 234 
<212> PRT 

<213> Adonis Palaestina 
<400> 40 

Met Gly Glu Val Thr Asp Ala Gly Met Asp Ala Val Gin Lys Arg Leu 
1 5 10 J 15 

Met Phe Asp Asp Glu Cys lie Leu Val Asp Glu Asn Asp Lys Val Val 

20 25 ' 30 

Gly His Asp Ser Lys Tyr Asn Cys His Leu Met Glu Lys lie Glu Ala 
35 40 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys Val Thr Phe Pro 
65 70 75 80 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Phe Arg Asp Ser 
85 90 95 

Glu Leu lie Glu Glu Asn Tyr Leu Gly Val Arg Asn Ala Ala Gin Arg 
100 105 110 

Lys Leu Leu Asp Glu Leu Gly lie Pro Ala Glu Asp Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg lie Leu Tyr Lys Ala Pro Ser Asp Gly 
130 135 140 

Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe lie Val Arg Asp 
145 150 155 160 

Val Lys Tyr Asp Pro Asn Pro Asp Glu Val Ala Asp Ala Lys Tyr Val 
165 170 175 

Asn Arg Glu Glu Leu Arg Glu lie Leu Arg Lys Ala Asp Ala Gly Glu 
180 185 190 

Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Phe Lys Trp Trp Asp His Val Glu Gin Gly Thr lie Lys Glu Val 
210 215 220 

Ala Asp Met Lys Thr He His Lys Leu Thr 
225 230 

<210> 41 
<211> 238 



290 



295 
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<212> PRT 

<213> Oryza Sativa 
<400> 41 

Met Ala Gly Ala Ala Ala Ala Val Glu Asp Ala Gly Met Asp Glu Val 
1 5 10 15 

Gin Lys Arg Leu Met Phe Asp Asp Glu Cys He Leu Val Asp Glu Gin 
20 25 30 

Asp Asn Val Val Gly His Glu Ser Lys Tyr Asn Cys His Leu Met -Glu 
35 40 4 5 

Lys He Glu Ser Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu 
50 55 60 

Phe Asn Ser Lys Tyr Glu Leu Leu Leu Gin Gin Arg Ser Ala Thr Lys 
65 70 75 80 

Val Thr Phe Pro Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu 
8 5 90 95 

Tyr Arg Glu Ser Glu Leu He Gin Glu Asn Tyr Leu Gly Val Arg Asn 
100 105 ' HO 

Ala Ala Gin Arg Lys Leu Leu Asp Glu Leu Gly He Pro Ala Glu Asp 
115 120 125 

Val Pro Val Asp Gin Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala 
130 135 - 140 

Pro Ser Asp Gly Lys Trp Gly Glu His Glu Leu Asp Tyr Leu Leu Phe 
i45 150 155 * 160 

He Val Arg Asp Val Lys Val Val Pro Asn Pro Asp Glu Val Ala Asp 
165 170 175 

Val Lys Tyr Val Ser Arg Glu Gin Leu Lys Glu Leu He Arg Lys Ala 
180 . 185 190 

Asp Ala Gly Glu Glu Gly Leu Lys Leu Ser Pro Trp Phe Arg Leu Val 
195 200 205 

Val Asp Asn Phe Leu Met Gly Trp Trp Asp His Val Glu Lys Gly Thr 
210 215 * 220 

Leu Asn Glu Ala Val Asp Met Glu Thr lie His Lys Leu Lys 
225 230 235 

<210> 42 
<211> 233 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 42 

Met Thr Asp Ser Asn Asp Ala Gly Met Asp Ala Val Gin Arg Arg Leu 
1 5 10 15 

Met Phe Glu Asp Glu Cys He Leu Val Asp Glu Asn Asn Arg Val Val 
20 25 30 
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Gly His Asp Thr Lys Tyr Asn Cys His Leu Met Glu Lys He Glu Ala 
35 40 45 

Glu Asn Leu Leu His Arg Ala Phe Ser Val Phe Leu Phe Asn Ser Lys 
50 '55 60 

Tyr Glu Leu Leu Leu Gin Gin Arg Ser Lys Thr Lys Val Thr Phe Pro 
65 ™ 75 ' so 

Leu Val Trp Thr Asn Thr Cys Cys Ser His Pro Leu Tyr Arg Glu Ser 
85 90 "* 95 

Glu Leu He Glu Glu Asn Val Leu Gly Val Arg Asn Ala Ala Gin Arq ' 
100 105 HO 

Lys Leu Phe Asp Glu Leu Gly He Val Ala Glu Asp Val Pro Val Asp 
115 120 125 

Glu Phe Thr Pro Leu Gly Arg Met Leu Tyr Lys Ala Pro Ser Asp Glv 
130 135 140 

Lys Trp Gly Glu His Glu Val Asp Tyr Leu Leu Phe He Val Arq Asd 
1^5 150 155 160 

Val Lys Leu Gin Pro Asn Pro Asp Glu Val Ala Glu He Lys Tyr Val 
165 170 175 

Ser Arg Glu Glu Leu Lys Glu Leu Val Lys Lys Ala Asp Ala Gly Asp 
180 185 190 

Glu Ala Val Lys Leu Ser Pro Trp Phe Arg Leu Val Val Asp Asn Phe 
195 200 205 

Leu Met Lys Trp Trp Asp His Val Glu Lys Gly Thr He Thr Glu Ala 
210 215 220 

Ala Asp Met Lys Thr He His Lys Leu 
225 230 

<210> 43 
<211> 293 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 43 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His He Pro Arg Val Asn 
1 5 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Leu Leu Ser Glu Asp Arg Thr Asp His Met Arg Gly 
35 40 45 

Ala Ser Thr Trp Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys 
50 55 60 

Asp Glu Cys He Leu Val Asp Val Glu Asp Asn He Thr Gly His Ala 
65 70 75 ~ 80 

Ser Lys Leu Glu Cys His Lys Phe Leu Pro His Gin Pro Ala Gly Leu 

HZ 
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85 90 95 

Leu His Arg Ala Phe Ser Val Phe Leu Phe ,Asp Asp Gin Gly Arg Leu 
100 105 110 

Leu Leu Gin Gin Arg Ala Arg Ser Lys He Thr Phe Pro Ser Val Trp 
115 120 125 

Thr Asn Thr Cys Cys Ser His Pro Leu His Gly Gin Thr Pro Asp Glu 
130 ^ 135 140 

Val Asp Gin Leu Ser Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys 
145 150 155 ' 160 

Ala Ala Ala He Arg Lys Leu Glu His Glu Leu Gly He Pro Ala His 
165 170 175 

Gin Leu Pro Ala Ser Ala Phe Arg Phe .Leu Thr Arg Leu His Tyr Cys 
180 185 190 

Ala Ala Asp Val Gin Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu 
195 200 205 

His Glu Met Asp Tyr He Leu Phe He Arg Ala Asn Val Thr Leu Ala 
210 " 215 220 

Pro Asn Pro Asp Glu Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu 
225 * 230 235 240 

Leu Arg Gin Met Met Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp 
245 250 255 

Phe Arg He He Ala Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu 
260 265 270 

Asp Ala Ala Leu Asn Thr Asp Lys His Glu Asp Trp Gly Thr Val His 
275 280 285 

His lie Asn Glu Ala 
290 

<210> 44 
<211> 304 
<212> PRT 

<213> Haematococcus pluvialis 
<400> 44 

Met Leu Arg Ser Leu Leu Arg Gly Leu Thr His He Pro Arg Val Asn 
15 10 15 

Ser Ala Gin Gin Pro Ser Cys Ala His Ala Arg Leu Gin Phe Lys Leu 
20 25 30 

Arg Ser Met Gin Met Thr Leu Met Gin Pro Ser He Ser Ala Asn Leu 
35 40 45 

Ser Arg Ala Glu Asp Arg Thr Asp His Met Arg Gly Ala Ser Thr Trp 
50 55 60 

Ala Gly Gly Gin Ser Gin Asp Glu Leu Met Leu Lys Asp Glu Cys He 
65 70 75 80 



M3 
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Leu Val Asp Val Glu Asp Asn lie Thr Gly His Ala Ser Lys Leu Glu 
8 5 90 95 

Cys' His Lys Phe Leu Pro His Pro Ala Gly Leu Leu His Arg. Ala Phe 
100 105 110 

Ser Val Phe Leu Phe Asp Asp Gin Gly Arg Leu Leu Leu Gin Gin Arg 
115 120 125 

Ala Arg Ser Lys lie Thr Phe Pro Ser Val Trp Thr Asn Thr Cys Cys 
130 135 14 0 

Ser His Pro Leu His Gly Gin Thr Pro Asp Glu Val Asp Gin Leu Ser ' 
145 150 155 160 

Gin Val Ala Asp Gly Thr Val Pro Gly Ala Lys Ala Ala Ala lie Arg 
165 170 * 175 

Lys Leu Glu His Glu Leu Gly lie Pro Ala His Gin Leu Pro Ala Ser 
180 185 190 

Ala Phe Arg Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Val Gin 
195 200 " 205 

Pro Ala Ala Thr Gin Ser Ala Leu Trp Gly Glu His Glu Met Asp Tyr 
210 215 220 

lie Leu Phe lie Arg Ala Asn Val Thr Leu Ala Pro Asn Pro Asp Glu 
225 230 235 240 

Val Asp Glu Val Arg Tyr Val Thr Gin Glu Glu Leu Arg Gin Met Met 
245 250 255 

Gin Pro Asp Asn Gly Leu Gin Trp Ser Pro Trp Phe Arg lie lie Ala 
260 265 270 

Ala Arg Phe Leu Glu Arg Trp Trp Ala Asp Leu Asp Ala Ala Leu Asn 
275 280 285 

Thr Asp Lys His Glu Asp Trp Gly Thr Val His His lie Asn Glu Ala 
290 295 300 

<210> 45 
<211> 307 
<212> PRT 

<213> Chlamydomonas reinhardtii 
<400> 45 

Met Arg Ser Ser Phe He Glu Pro Lys Pro Arg Ala Gin Pro Val Leu 
1 5 10 15 

Ser Arg Gly Arg Ala Ser Met Arg Leu Ala Gin Ser Arg Ala Leu Val 
20 25 30 

Ala Arg Val Ser Ser Ala Leu Trp Pro Gly Ala Gly Leu Ser Gin Ala 
35 40 45 

Gin Ser Val Ala Val Arg Met Ala Ser Ser Ser Thr Trp Glu Gly Thr 
50 55 60 

Gly Leu Ser Gin Asp Asp Phe Met Gin Arg Asp Glu Cys Leu Val Val 

mm 
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65 



70 



75 



80 



Asp Glu Gin Asp Arg Leu Leu Gly Thr Ala Asn Lys Tyr Asp Cys His 
85 90 95 

Arg Phe Glu Ala Ala Lys Gly Gin Pro Cys Gly Arg Leu His Arg Ala 
100 105 110 

Phe Ser Val Phe Leu Phe Ser Pro Asp Gly Arg Leu Leu Leu Gin Gin 
115 120 125 

Arg Ala Ala Ser Lys Val Thr Phe Pro Gly Val Trp Thr Asn Thr Cys 
130 135 140 

Cys Ser His Pro Leu Ala Gly Gin Ala Pro Asp Glu Val Asp Leu Pro 
145 150 155 160 

Ala Ala Val Ala Ser Gly Gin Val Pro Gly lie Lys Ala Ala Ala Val 
165 ~ 170 175 

Arg Lys Leu Gin His Glu Leu Gly lie Pro Pro Glu Gin Val Pro Ala 
180 185 190 

Ser Ser Phe Ser Phe Leu Thr Arg Leu His Tyr Cys Ala Ala Asp Thr 
195 200 205 

Ala Thr His Gly Pro Ala Ala Glu Trp Gly Glu His Glu Val Asp Tyr 
210 215 220 

Val Leu Phe Val Arg Pro Gin Gin Pro Val Ser Leu Gin Pro Asn Pro 
225 230 235 240 

Asp Glu Val Asp Ala Thr Arg Tyr Val Thr Leu Pro Glu Leu Gin Ser 
245 250 255 

Met Met Ala Asp Pro Gly Leu Ser Trp Ser Pro Trp Phe Arg He Leu 
260 " 265 270 

Ala Thr Gin Pro Ala Phe Leu Pro Ala Trp Trp Gly Asp Leu Lys Arg 
275 280 285 

Arg Trp Arg Pro Gly Gly Ser Arg Leu Ser Asp Trp Gly Thr He His 
290 ' 295 300 



Arg Val Met 
305 

<210> 46 
<211> 1848 
<212> DNA 

<213> Adonis palaestina 
<400> 46 

GAGAGAAAAA GAGTGTTATA TTAATGTTAC TGTCGCATTC TTGCAACACA TAT TC AG ACT 60 
CCATTTTCTT GTTTTCTCTT CAAAACAACA AACTAATGTG ACGGAGTATC TAGCTATGGA 120 
ACTACTTGGT GTTCGCAACC TCATCTCTTC TTGCCCTGTC TGGACTTTTG GAACAAGAAA 180 
CCTTAGTAGT TCAAAACTAG CTTATAACAT ACATCGATAT GGTTCTTCTT G TAG AG TAG A 24 0 
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TTTTCAAGTG AGGGCTGATG GTGGAAGCGG GAGTAGAACT TCTGTTGCTT ATAAAGAGGG 300 

TTTTGTGGAC GAGGAGGATT TTATCAAAGC TGGTGGTTCT GAGCTTTTGT TTGTCCAAAT 360 

GCAGCAAACA AAGTCTATGG AGAAACAGGC CAAGCTCGCC GATAAGTTGC CACCAATACC 4 20 

TTTCGGAGAA TCTGTGATGG ACTTGGTTGT AATAGGTTGT GGACCTGCTG GTCTTTCACT 4 80 

GGCTGCAGAA GCTGCTAAGC TAGGCTTGAA AGTTGGCCTT ATTGGTCCTG ATCTTCCTTT. 54 0 

TACAAATAAT TATGGTGTGT GGGAAGACGA GTTCAAAGAT CTTGGACTTG AACGTTGTAT 600. 

CGAGCATGCT TGGAAGGACA CCATCGTATA TCTTGACAAT GATGCTCCTG TCCTTATTGG 660 

TCGTGCATAT GGACGAGTTA GCCGGCATTT GCTGCATGAA GAGTTGCTGA AAAGGTGTGT 7 20 

CGAGTCAGGT GTATCATATC TGAATTCTAA AGTGGAAAGG ATCACTGAAG CTGGTGATGG 7 80 

CCATAGTCTT GTAGTTTGTG AAAACGACAT CTTTATCCCT TGCAGGCTTG CTACTGTTGC 84 0 

ATCTGGAGCA GCTTCAGGGA AACTTTTGGA GTATGAAGTA GGTGGCCCTC GTGTTTGTGT 900 

CCAAACTGCT TATGGTGTGG AGGTTGAGGT GGAGAACAAT CCATACGATC CCAACTTAAT 960 

GGTATTTATG GACTACAGAG ACTATATGCA ACAGAAATTA CAGTGCTCGG AAGAAGAATA 1020 

TCCAACATTT CTCTATGTCA TGCCCATGTC GCCAACAAGA CTTTTTTTTG AGGAAACCTG 1080 

TTTGGCCTCA AAAGATGCCA TGCCTTTCGA TCTACTGAAG AG AAAAC T AA TGTCACGATT 1140 

GAAGACTCTG GGTATCCAAG T T AC AAAAAT TTATGAAGAG GAATGGTCTT ATATTCCTGT 1200 

TGGGGGTTCT TTACCAAACA CAGAGCAAAA GAACCTAGCA TTTGGTGCTG CAGCAAGCAT 1260 

GGTGCATCCA GCAACAGGCT ATTCGGTTGT AC GAT C ACT A TCAGAAGCTC CAAAATATGC 1320 

TTCTGTAATT GCAAAGATTT TGAAGCAAGA TAACTCTGCA TATGTGGTTT CTGGACAAAG 1380 

CAGTGCAGTA AACATTTCAA TGCAAGCATG GAGCAGTCTT TGGCCAAAGG AGCGAAAACG 14 4 0 

TCAAAGAGCA TTCTTTCTTT TCGGGTTAGA GCTTATTGTG CAGCTAGATA TTGAAGCAAC 1500 

CAGAACGTTC TTTAGAACCT TCTTCCGCTT GCCAACTTGG ATGTGGTGGG GTTTCCTTGG 1560 

GTCTTCACTA TCATCTTTCG ATCTTGTATT GTTTTCCATG TACATGTTTG TTTTGGCCCC 1620 

GAACAGCATG AGGATGTCAC TTGTGAGACA TTTGCTTTCA GATCCTTCTG GTGCAGTTAT 1680 

GGTTAAAGCT TACCTCGAAA GGTAATCTGT TTTATGAAAC TATAGTGTCT CATTAAATAA 17 4 0 

ATGAGGATCC TTCGTATATG TATATGATCA TCTCTATGTA TATCCTATAT TCTAATCTCA 1800 

TAAAGTAATC GAAAATTCAT TGATAGAAAA AAAAAAAAAA AAAAAAAA 1848 

<210> 47 
<211> 529 
<212> PRT 
. <213> Adonis palaestina 

<400> 47 

Met Glu Leu Leu Gly Val Arg Asn Leu lie Ser Ser Cys Pro Val Trp 

ML 
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1 



5 



10 



15 



Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser'Lys Leu Ala Tyr Asn lie 
20 ~ 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 " 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe lie Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 ^ 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 95 

Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 110 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 ^ 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 ' 215 220 

Leu Val Val Cys Glu Asn Glu He Phe He Pro Cys Arg Leu Ala Thr 
225 "* 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr <Hu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 



325 



330 



335 
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Lys Leu Met Ser Arg Leu Lys Thr Leu Gly lie Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro Asn 
355 360 " 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys lie Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn lie Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu lie Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 ~ 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
4 85 4 90 4 95 

Met Phe Val Leu Ala Pro Asn Sex Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 

<210> 48 
<211> 378 
<212> PRT 
<213> Potato 

<400> 48 

Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys lie Glu His Val Trp 
1 5 10 15 

Arg Asp Thr lie Val Tyr Leu Asp Asp Asp Asp Pro lie Leu lie Gly 
20 25 30 

Arg Ala Tyr Gly Arg Val Ser Arg His Leu Leu His Glu Glu Leu Leu 
35 40 45 

Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn Ser Lys Val Asp 
50 "* 55 60 

Arg lie Val Glu Ala Thr Asn Gly His Ser Leu Val Glu Cys Glu Gly 
65 70 75 ~ 80 

Asp Val Val lie Pro Cys Arg Phe Val Thr Val Ala Ser Gly Ala Ala 
85 90 95 
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Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Gly Pro Arg Val Ser Val 
100 , 105 110 

Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn Asn Pro Phe Asp 
115 120 125 

Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr Val Arg His Asp 
130 135 140 

Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu Tyr Ala Met Pro 
145 150 155 160 

Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys Leu Ala Ser Lys 
165 170 175 

Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu Met Leu Arg Leu 
180 185 190 

Asn Thr Leu Gly Val Arg lie Lys Glu lie Tyr Glu Glu Glu Trp Ser 
195 200 205 

Tyr lie Pro Val Gly Gly Ser Leu Pro Asn Thr Glu Gin Lys Thr Leu 
210 " 215 220 

Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser 
225 230 235 240 

Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala Phe Val Leu Ala 
245 250 255 

Asn lie Leu Arg Gin Asn His Ser Lys Asn Met Leu Thr Ser Ser Ser 
260 265 270 

Thr Pro Ser lie Ser Thr Gin Ala Trp Asn Thr Leu Trp Pro Gin Glu 
275 280 285 

Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu Ala Leu lie Leu 
290 295 300 

Gin Leu Asp lie Glu Gly lie Arg Ser Phe Phe Arg Ala Phe Phe Arg 
305 310 315 320 

Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Ser Leu Ser Xaa 
325 " 330 335 

Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe lie lie Ala Pro Asn 
340 345 350 

Asp Met Arg Arg Gly Leu lie Arg His Leu Leu Ser Asp Pro Thr Gly 
355 360 365 



Ala Thr Leu lie Arg Thr Tyr Leu Thr Phe 
370 375 



<210> 49 

<211> 524 

<212> PRT 

<213> Arabidopsis 

<400> 49 



thaliana 
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Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 



Ser Tyr Arg Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 " 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu lie Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 
85 90 95 

Asp Lys Leu Pro Pro lie Ser lie Gly Asp Gly Ala Leu Asp His Val 
100 105 110 

Val lie Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 " 120 125 

Lys Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 ^ " 150 155 160 

Lys Cys lie Glu His Val Trp Arg Glu Thr lie Val Tyr Leu Asp Asp 
165 17b 175 

Asp Lys Pro lie Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 

Tyr Leu Ser Ser Lys Val Asp Ser He Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 
225 230 235 240 

Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 " 310 315 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 



1 



5 



10 



15 



Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 
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325 330 335 

Thr Lys Leu Met Leu Arg Leu Asp Thr Leu Gly lie Arg lie Leu Lys 
340 345 350 

Thr Tyr Glu Glu Glu Trp Ser Tyr lie Pro Val Gly Gly Ser Leu Pro 
355 360 365 

Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 
370 375 380 

His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pre 
385 390 395 400 

Lys Tyr Ala Ser Val lie Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
405 410 415 

Gin He Asn Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 
420 425 430 

Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 445 

He Val Gin Phe Asp Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 
4 50 4 55 4 60 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 
485 490 495 

Pro Asn Asn Leu Arg Lys Gly Leu He Asn His Leu He Ser Asp Pro 
500 " 505 510 

Thr Gly Ala Thr Met He Lys Thr Tyr Leu Lys Val 
515 520 



<210> 50 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 50 

Met Glu Leu Leu Gly Val Arg Asn Leu He Ser Ser Cys Pro Val Trp 
1 5 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn He 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe He Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
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85 
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95 



Lys Leu Pro Pro lie Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 

100 105 110 

lie Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 

115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 

130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arq 

145 150 155 . J 160 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr, Gly Arg Val Ser Arg His Leu 

180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 

195 200 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 

210 215 220 

Leu Val Val Cys Glu Asn Glu He Phe He Pro Cys Arg Leu Ala Thr 

225 230 235 " 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 

260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 

275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 

290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 

305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys Val 

340 345 . 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 

355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 

370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 

385 390 395 400 

Tyr Ala Ser Val He Ala Lys He Leu Lys Gin Asp Asn Ser Ala Tyr 



405 



410 



415 
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Val Val Ser Gly Gin Ser Ser Ala Val Asn He Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro* Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 " 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu Glu 
515 520 525 

Arg 



<210> 51 
<211> 529 
<212> PRT 

<213> Adonis palaestina 
<400> 51 

Met Glu Leu Leu Gly Val Arg Asn Leu He Ser Ser Cys Pro Val Trp 
1 5 10 15 

Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn He 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Thr Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe He Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 
65 70 75 80 

Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 
85 90 95 

Lys Leu Pro Pro He Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 no 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu lie Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 
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Cys lie Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He, Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asn Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val Val Cys Glu Asn Asp He Phe He Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
24 5 250 " 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 

Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
. 325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys He 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val lie Ala Lys He Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn He Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Arg Thr 
4 50 4 55 4 60 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 * 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
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485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Lys Ala Tyr Leu Glu 
515 520 525 

Arg 



<210> 52 

<211> 533 

<212> PRT 

<213> Lettuce 

<400> 52 

Met Glu Cys Phe Gly Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
15 10 15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn He Arg His Lys Phe Ser Leu 
20 25 ' 30 

Leu Lys Gin Arg Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arq 
35 40 45 

Gin He Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50 55 60 

Gly He Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 
85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin He Pro He Gly Asn Cys He 
100 105 no 

Leu Asp Leu Val Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu He -Gly Pro Asp 
130 135 140 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe He Gly 
145 150 155 i 6 o 

Leu Gly Leu Glu Gly Cys He Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 ^ 175 

Tyr Leu Asp Asp Ala Asp Pro He Arg He Gly Arg Ala Tyr Gly Arq 
180 185 190 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 

Ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg He Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu He Glu Cys Glu Gly Asn He Thr He Pro 
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225 230 235 240 

Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245 250 255 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Cys Val' Gin Thr Ala Tyr Gly 
260 265 270 

He Glu Val Glu Val Glu Asn Asn Pro Tyr Asp Pro Asp Leu Met Val 
275 280 285 

Phe Met Asp Tyr Arg Asp Phe Ser Lys His Lys Pro Glu Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Val Met Ala Met Ser Pro Thr Lvs 
305 310 315 320 

He Phe Phe Glu Glu Thr Cys Leu Ala Ser Arg Glu Ala Met Pro Phe 
325 330 335 

Asn Leu Leu Lys Ser Lys Leu Met Ser Arg Leu Lys Ala Met Gly He 
34 0 34 5 350 

Arg He Thr Arg Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Glv 
355 360 365 

Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 40 0 

Ser Glu Ala Pro Asn Tyr Ala Ala Val He Ala Lys He Leu Arg Gin 
405 410 415 

Asp Gin Ser Lys Glu Met lie Ser Leu Gly Lys Tyr Thr Asn He Ser 
420 425 430 

Lys Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arq 
435 440 445 

Ala Phe Phe Leu Phe Gly Leu Ser His He Val Leu Met Asp Leu Glu 
450 455 460 

Gly Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp Met 
465 470 475 480 

Trp Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu He lie 
485 490 495 

Phe Ala Leu Tyr Met Phe Val lie Ala Pro His Ser Leu Arg Met Glu 
500 505 510 

Leu Val Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Met Val Lys 
515 520 525 

Ala Tyr Leu Thr lie 
530 

<210> 53 



SX 
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<211> 526 
<212> PRT 
<213> Tomato 

<400> 53 

Met Glu Cys Val Gly Val Gin Asn Val Gly Ala Met Ala Val Leu Thr 
15 10 15 

Arg Pro Arg Leu Asn Arg Trp Ser Gly Gly Glu Leu Cys Gin Glu Lys 
20 25 ^30 

Ser lie Phe Leu Ala Tyr Glu Gin Tyr Glu Ser Lys Cys Asn Ser Ser 
35 40 45 

Ser Gly Ser Asp Ser Cys Val Val Asp Lys Glu Asp Phe Ala Asp Glu 
50 55 60 

Glu Asp Tyr lie Lys Ala Gly Gly Ser Gin Leu Val Phe Val Gin Met 
65 70 75 80 

Gin Gin Lys Lys Asp Met Asp Gin Gin Ser Lys Leu Ser Asp Glu Leu 
85 90 95 

Arg Gin He Ser Ala Gly Gin Thr Val Leu Asp Leu Val Val He Gly 
100 105 HO 

Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala Lys Leu Gly 
115 120 125 

Leu Asn Val Gly Leu Val Gly Pro Asp Leu Pro Phe Thr Asn Asn Tyr 
130 135 140 

Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Gin Ala Cys He 
145 150 155 160 

Glu His Val Trp Arg Asp Thr He Val Tyr Leu Asp Asp Asp Glu Pro 
165 170 175 

He Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Phe Leu His 
180 185 190 

Glu Glu Leu Leu Lys Arg Cys Val Glu Ala Gly Val Leu Tyr Leu Asn 
195 200 " 205 

Ser Lys Val Asp Arg He Val Glu Ala Thr Asn Gly Gin Ser Leu Val 
210 215 220 

Glu Cys Glu Gly Asp Val Val He Pro Cys Arg Phe Val Thr Val Ala 
225 230 235 240 

Ser Gly Ala Ala Ser Gly Lys Phe Leu Gin Tyr Glu Leu Gly Ser Pro 
245 250 255 

Arg Val Ser Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val Asp Asn 
260 2 65 270 

Asn Pro Phe Asp Pro Ser Leu Met Val Phe Met Asp Tyr Arg Asp Tyr 
275 280 285 

Leu Arg His Asp Ala Gin Ser Leu Glu Ala Lys Tyr Pro Thr Phe Leu 
290 295 300 
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Tyr Ala Met Pro Met Ser Pro Thr Arg Val Phe Phe Glu Glu Thr Cys 
305 310 315 320 

Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Lys Lys Leu 
325 330 " 335 

Met Leu Arg Leu Asn Thr Leu Gly Val Arg lie Lys Glu lie Tyr Glu 
340 345 350 

Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn Thr Glu 
355 3.60 365 

Gin Lys Thr Leu Ala Phe Gly Ala Ala Ala Ser Met Val His Pro Ala 
370 375 380 

Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys Cys Ala 
385 390 395 400 

Ser Val Leu Ala Asn He Leu Arg Gin His Tyr Ser Lys Asn Met Leu 
405 410 " 415 

Thr Ser Ser Ser He Pro Ser He Ser Thr Gin Ala Trp Asn Thr Leu 
420 425 430 

Trp Pro Gin Glu Arg Lys Arg Gin Arg Ser Phe Phe Leu Phe Gly Leu 
435 440 445 

Ala Leu . He Leu Gin Leu Asp He Glu Gly He Arg Ser Phe Phe Arg 
450 455 460 

Ala Phe Phe Arg Val Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser 
465 470 475 480 

Ser Leu Ser Ser Ala Asp Leu Met Leu Phe Ala Phe Tyr Met Phe He 
485 490 495 

lie Ala Pro Asn Asp Met Arg Lys Gly Leu He Arg His Leu Leu Ser 
500 505 " 510 

Asp Pro Thr Gly Ala Thr Leu He Arg Thr Tyr Leu Thr Phe 
515 520 " " 525 

<210> 54 
<211> 516 
<212> PRT 

<213> Tagetes erecta 
<400> 54 

Met Ser Met Arg Ala Gly His Met Thr Ala Thr Met Ala Ala Phe Thr 
15 10 15 

Cys Pro Arg Phe Met Thr Ser lie Arg Tyr Thr Lys Gin lie Lys Cys 
20 25 30 

Asn Ala Ala Lys Ser Gin Leu Val Val Lys Gin Glu lie Glu Glu Glu 
35 40 45 

Glu Asp Tyr Val Lys Ala Gly Gly Ser Glu Leu Leu Phe Val Gin Met 
50 55 60 
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Gin Gin Asn Lys Ser Met Asp Ala Gin Ser Ser Leu Ser Gin Lys Leu 
65 70 75 80 

Pro Arg Val Pro lie Gly Gly Gly Gly Asp Ser Asn Cys He Leu Asp 
85 90 95 

Leu Val Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Gly Glu 
100 105 no 

Ser Ala Lys Leu Gly Leu Asn Val Ala Leu He Gly Pro Asp Leu Pro 
115 120 125 

Phe Thr Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe He Gly Leu Glv 
130 135 140 

Leu Glu Gly Cys He Glu His Val Trp Arg Asp Thr Val Val Tyr Leu 
145 1^0 155 160 

Asp Asp Asn Asp Pro lie Leu He Gly Arg Ala Tyr Gly Arg Val Ser 
165 170 - 175 

Arg Asp Leu Leu His Glu Glu Leu Leu Thr Arg Cys Met Glu Ser Gly 
180 185 190 

Val Ser Tyr Leu Ser Ser Lys Val Glu Arg He Thr Glu Ala Pro Asn 
195 200 205 

Gly Leu Ser Leu He Glu Cys Glu Gly Asn He Thr He Pro Cys Arq 
210 215 220 

Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr 
225 230 235 240 

Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly He Glu 
245 250 " 255 

Val Glu Val Glu Ser He Pro Tyr Asp Pro Ser Leu Met Val Phe Met 
260 265 270 

Asp Tyr Arg Asp Tyr Thr Lys His Lys Ser Gin Ser Leu Glu Ala Gin 
275 280 285 

Tyr Pro Thr Phe Leu Tyr Val Met Pro Met Ser Pro Thr Lys Val Phe 
290 295 300 

Phe Glu Glu Thr Cys Leu Ala Ser Lys Glu Ala Met Pro Phe Glu Leu 
305 310 . 315 320 

Leu Lys Thr Lys Leu Met Ser Arg Leu Lys Thr Met Gly He Arg lie 
325 • 330 " 335 

Thr Lys Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser 
340 345 350 

Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser 
355 360 365 

Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu 
370 375 380 

Ala Pro Asn Tyr Ala Ala Val lie Ala Lys lie Leu Gly Lys Gly Asn 
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385 390 395 400 

Ser Lys Gin Met Leu Asp His Gly Arg Tyr.. Thr Thr Asn lie Ser Lys 
405 410 415 

Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg Ala 
420 425 430 

Phe Phe Leu Phe Gly Leu Ala Leu He Val Gin Met Asp He Glu Gly 
4 35 440 44 5 

Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp 
450 455 460 

Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu He He Phe 
465 470 475 480 

Ala Phe Tyr Met Phe He He Ala Pro His Ser Leu Arg Met Gly Leu 
485 490 ~ 495 

Val Arg His Leu Leu Ser Asp Pro Thr Gly Gly Thr Met Leu Lys Ala 
500 505 " 510 

Tyr Leu Thr He 
515 

<210> 55 
<211> 501 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 55 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Asp Phe Phe He Pro 
1 5 10 15 

Gin Phe His Gly Phe Glu Arg Leu Cys Ser Asn Asn Pro Tyr His Ser 
20 25 30 

Arg Val Arg Leu Gly Val Lys Lys Arg Ala He Lys He Val Ser Ser 
35 40 45 

Val Val Ser Gly Ser Ala Ala Leu Leu Asp Leu Val Pro Glu Thr Lys 
50 55 60 

Lys Glu Asn Leu Asp Phe Glu Leu Pro Leu Tyr Asp Thr Ser Lys Ser 
65 70 75 80 

Gin Val Val Asp Leu Ala He Val Gly Gly Gly Pro Ala Gly Leu Ala 
85 90 95 

Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser He Asp 
100 105 HO 

Pro Ser Pro Lys Leu He Trp Pro Asn Asn Tyr Gly Val Trp Val Asp 
115 120 " 125 

Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr Thr Trp Ser 
130 135 ^ 140 

Gly Ala Val Val Tyr Val Asp Glu Gly Val Lys Lys Asp Leu Ser Arg 
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145 150 155 '" 16p 

Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Leu Gin 
165 170 " I75 

Lys Cys lie Thr Asn Gly Val Lys Phe His Gin Ser Lys Val Thr Asn 
180 185 190 

Val Val His Glu Glu Ala Asn Ser Thr Val Val Cys Ser Asp Gly Val 
195 200 205 

Lys lie Gin Ala Ser Val Val Leu Asp Ala Thr Gly Phe Ser Arq Cvs 
210 215 220 

Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tvr 
225 230 235 240 

Gly He Val Ala Glu Val Asp Gly His Pro Phe Asp Val Asp Lys Met 
245 250 ' 255 

Val Phe Met Asp Trp Arg Asp Lys His Leu Asp Ser Tyr Pro Glu Leu 
260 265 270 

Lys Glu Arg Asn Ser Lys He Pro Thr Phe Leu Tyr Ala Met Pro Phe 
275 280 285 

Ser Ser Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Arq Pro 
290 295 300 

Gly Leu Arg Met Glu Asp He Gin Glu Arg Met Ala Ala Arg Leu Lys 
305 310 315 320 

His Leu Gly lie Asn Val Lys Arg He Glu Glu Asp Glu Arg Cvs Val 
325 330 335 

He Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Glv 
340 345 350 

He Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val 
355 360 365 

Ala Arg Thr Leu Ala Ala Ala Pro He Val Ala Asn Ala He Val Arg 
370 375 380 

Tyr Leu Gly Ser Pro Ser Ser Asn Ser Leu Arg Gly Asp Gin Leu Ser 
385 3 90 395 400 

Ala Glu Val Trp Arg Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arq 
405 410 " 4 i5 

Glu Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Asp 
420 425 430 

Ala Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Gin Pro His Tvr 
435 440 445 

Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu Val 
450 455 460 

Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu 
465 470 475 4 80 
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lie Met Thr LysGly Thr Val Pro Leu Ala Lys Met lie Asn Asn Leu 
485 490 495 

Val Gin Asp Arg Asp 
500 

<210> 56 
<211> 502 
<212> PRT 

<213> Adonis palaestina 
<400> 56 

Met Asp Thr Leu Leu Arg Thr His Asn Lys Leu Glu Leu Leu Pro Thr 
15 10 15 

Leu His Gly Phe Ala Glu Lys Gin His Leu Val Ser Thr Ser Lys Leu 
20 25. 30 

Gin Asn Gin Val Phe Arg lie Ala Ser Arg Asn lie His Pro Cys Arg 
35 4 0 4 5 

Asn Gly Thr Val Lys Ala Arg Gly Ser Ala Leu Leu Glu Leu Val Pro 
50 55 60 

Glu Thr Lys Lys Glu Asn Leu Glu Phe Asp Leu Pro Ala Tyr Asp Pro 
65 70 75 80 

Ser Arg Gly lie Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala 
85 90 * 95 

Gly Leu Ala lie Ala Gin Gin Val Ser Glu Ala Gly Leu Leu Val Cys 
100 105 110 

Ser lie Asp Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val 
115 120 125 

Trp Val Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Thr. 
130 135 140 

Thr Trp Ser Gly Ala Val Val Tyr Thr Asp Asp Asn Ser Lys Lys Tyr 
145 150 155 160 

Leu Asp Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys 
165 170 175 

Met Leu Gin Lys Cys Val Thr Asn Gly Val Lys Phe His Gin Ala Lys 
180 185 190 

Val lie Lys Val lie His Glu Glu Ser Lys Ser Leu Leu lie Cys Asn 
195 200 205 

Asp Gly lie Thr lie Asn Ala Thr Val Val Leu Asp Ala Thr Gly Phe 
210 215 220 

Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin 
225 230 235 240 

Val Ala Tyr Gly lie Met Ala Glu Val Glu Glu His Pro Phe Asp Leu 
245 250 255 
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Asp Lys Met Leu Phe Met Asp Trp Arg Asp Ser His Leu Asn Glu Lys 
260 265 270 ■ 

Leu Glu Leu Lys Asp. Lys Asn Arg Lys lie Pro Thr Phe Leu Tyr Ala 
275 280 285 

Met Pro Phe Ser Ser Thr Lys He Phe Leu Glu Glu Thr Ser Leu Val 

295 300 

Ala Arg Pro Gly Leu Arg Phe Glu Asp He Gin Glu Arg Met Val Ala 
J ° 5 310 315 320 

Arg Leu Lys His Leu Gly He Lys Val Lys Ser He Glu Glu Asp Glu 
325 330 335 

Arg Cys Val He Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Ara 
340 345 350 * 

Val Val Gly He Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Glv 
355 360 365 y 

Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Lys Ser ' 
370 375 380 

lie Val Gin Tyr Leu Gly Ser Asp Arg Ser Leu Ser Gly Asn Glu Leu 
ooz> 390 ' 



395 ~ 400 



Ser Ala Glu Val Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin 
405 410 41 5 

Arg Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu 
420 425 430 

Gin Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro His 
435 440 445 

Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu Leu 
450 455 460 

Phe Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Ala Ser Arg He 
465 470 475 y 48O 

Glu He Met Ala Lys Gly Thr Val Pro Leu Val Asn Met Met Asn Asn 
485 490 495 

Leu He Gin Asp Thr Asp 
500 



<210> 57 

<211> 498 

<212> PRT 

<213> Pepper 

<400> 57 

Met Asp Thr Leu Leu Arg Thr Pro 
1 5 

Phe Gly Val Lys Val Ser Ala Phe 
20 



Asn Asn Leu Glu Phe Leu His Gly 
10 15 

Ser Ser Val Lys Ser Gin Lys Phe 
25 30 
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Gly Ala Lys Lys' Phe Cys Glu Gly Leu Gly Ser Arg Ser Val Cys Val 
35 40 ' 45 

Lys Ala Ser Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr Lys Lys 
50 55 60 

Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys Gly Val 
65 70 75 " 80 

Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu Ala Val 
85 90 ' 95 

Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser lie Asp Pro 
100 105 ~ 110 

Asn Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val Asp Glu 
115 120 125 

Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp Ser Gly 
130 135 * 140 

Ala Ala Val Tyr lie Asp Asp Lys Thr Thr' Lys Asp Leu Asn Arg Pro 
145 150 155 160 

Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met Gin Lys 
165 170 175 

Cys lie Leu Asn Gly Val Lys Phe His Gin Ala Lys Val lie Lys Val 
180 185 " 190 

He His Glu Glu Ser Lys Ser Met Leu He Cys Asn Asp Gly He Thr 
195 200 ' - 205 

He Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg Ser Leu 
210 215 " 220 

Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala Tyr Gly 
225 230 235 240 

He Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys Met Val 
245 250 255 

Phe Met Asp Trp Arg Asp Ser His Leu Lys Asn Asn Val Glu Leu Lys 
260 265 270 

Glu Arg Asn Ser Arg He Pro Thr Phe Leu Tyr Ala Met Pro Phe Ser 
275 280 " 285 

Ser Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Arg Pro Gly 
290 295 300 

Leu Gly Met Asp Asp He Gin Glu Arg Met Val Ala Arg Leu Ser His 
305 310 315 " 320 

Leu Gly He Lys Val Lys Ser He Glu Glu Asp Glu His Cys Val He 
325 330 335 

Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val Gly He 
340 345 350 

Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met Val Ala 
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355 - 360 365 

Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala lie He Gin Tyr 
370 375 380 

Leu Ser Ser Glu Arg Ser His Ser Gly Asp Glu Leu Ser Ala Ala Val 
385 390 395 40 0 

Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu Phe Phe 
405 410 415 

Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala Thr Arq- 
420 425 430 

Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arg Tyr Trp His Gly 
4 35 440 44 5 

Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu He Val Phe Gly Leu 
450 455 460 

Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Leu Glu He Met Thr 
465 470 475 480 

Lys Gly Thr Leu Pro Leu Val His Met He Asn Asn Leu Leu Gin Asp 
485 490 495 

Lys Glu 



<210> 58 

<211> 500 

<212> PRT 

<213> Tomato 

<400> 58 

Met Asp Thr Leu Leu Lys Thr Pro Asn Asn Leu Glu Phe Leu Asn Pro 
15 10 15 

His His Gly Phe Ala Val Lys Ala Ser Thr Phe Arg Ser Glu Lys His 
20 25 30 

His Asn Phe Gly Ser Arg Lys Phe Cys Glu Thr Leu Gly Arg Ser Val 
35 40 45 

Cys Val Lys Gly Ser Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 55 60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 70 75 4 80 

Gly Val Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 
85 90 95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Cys Ser He 
100 105 HO 

Asp Pro Asn Pro Lys Leu He Trp Pro Asn Asn Tyr Gly Val Trp Val 
115 120 125 

Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
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130 135 140 

Ser Gly Ala- Ala Val Tyr He Asp Asp Asn Thr Ala Lys Asp Leu His" 
. . 145 , 1^0 155 160 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 170 175 

Gin Lys Cys He Met Asn Gly Val Lys Phe His Gin Ala Lys Val He 
180 185 190 

Lys Val He His Glu Glu Ser Lys Ser Met Leu He Cys Asn Asp Glv 
195 200 205 

He Thr He Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Ara 
210 215 220 

Ser Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tyr Gin Val Ala 
225 230 235 240 

Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Val Asn Lys 
245 250 255 

Met Val Phe Met Asp Trp Arg Asp Ser His Leu Lys Asn Asn Thr Asp 
260 265 270 

Leu Lys Glu Arg Asn Ser Arg He Pro Thr Phe Leu Tyr Ala Met Pro 
275 280 285 

Phe Ser Ser Asn Arg He Phe Leu Glu Glu Thr Ser Leu Val Ala Ara 
290 295 300 

Pro Gly Leu Arg He Asp Asp He Gin Glu Arg Met Val Ala Arq Leu 
305 310 315 320 

Asn His Leu Gly He Lys Val Lys Ser He Glu Glu Asp Glu His Cys 
325 330 " 335 

Leu He Pro Met Gly Gly Pro Leu Pro Val Leu Pro Gin Arg Val Val 
340 345 350 

Gly He Gly Gly Thr Ala Gly Met Val His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He 
370 375 380 

Gin Tyr Leu Gly Ser Glu Arg Ser His Ser Gly Asn Glu Leu Ser Thr 
385 390 395 400 

Ala Val Trp Lys Asp Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu 
405 410 ~ 415 

Phe Phe Cys Phe Gly Met Asp He Leu Leu Lys Leu Asp Leu Pro Ala 
420 425 430 

Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arq Tyr Trp 
435 440 445 

His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu He Val Phe 
450 455 460 
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Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Ser Arg Phe Glu lie 
465 470 475 480 

Met Thr Lys Gly Thr Val Pro Leu Val Asn Met lie Asn Asn Leu Leu 
485 490 495 

Gin Asp Lys Glu 
500 

<210> 59 
<211> 500 
<212> PRT 
<213> Tobacco 

<400> 59 

Met Asp Thr Leu Leu Lys Thr Pro Asn Lys Leu Glu Phe Leu His Pro 
15 10 15 

Val His Gly Phe Ser Val Lys Ala Ser Ser Phe Asn Ser Val Lys Pro 
20 25 30 

His Lys Phe Gly Ser Arg Lys lie Cys Glu Asn Trp Gly Lys Gly Val 
35 40 45 

Cys Val Lys Ala Lys Ser Ser Ala Leu Leu Glu Leu Val Pro Glu Thr 
50 55 60 

Lys Lys Glu Asn Leu Asp Phe Glu Leu Pro Met Tyr Asp Pro Ser Lys 
65 70 75 " 1 80 

Gly Leu Val Val Asp Leu Ala Val Val Gly Gly Gly Pro Ala Gly Leu 
85 90 ^ * 95 

Ala Val Ala Gin Gin Val Ser Glu Ala Gly Leu Ser Val Val Ser lie 
100 105 110 

Asp Pro Ser Pro Lys Leu lie Trp Pro Asn Asn Tyr Gly Val Trp Val 
115 120 125 

Asp Glu Phe Glu Ala Met Asp Leu Leu Asp Cys Leu Asp Ala Thr Trp 
130 135 140 

Ser Gly Thr Val Val Tyr lie Asp Asp Asn Thr Thr Lys Asp Leu Asp 
145 150 155 * 160 

Arg Pro Tyr Gly Arg Val Asn Arg Lys Gin Leu Lys Ser Lys Met Met 
165 170 175 

Gin Lys Cys lie Leu Asn Gly Val Lys Phe His His Ala Lys Val lie 
180 185 190 

Lys Val lie His Glu Glu Ala Lys Ser Met Leu lie Cys Asn Asp Gly 
195 200 205 

Val Thr lie Gin Ala Thr Val Val Leu Asp Ala Thr Gly Phe Ser Arg 
210 215 220 

Cys Leu Val Gin Tyr Asp Lys Pro Tyr Lys Pro Gly Tyr Gin Val Ala 
225 230 235 240 
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Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp Thr Ser Lvs 
245 250 255 

Met Val Leu Met Asp Trp Arg Asp Ser His Leu Gly Asn Asn Met Glu 
260 265 270 

Leu Lys Glu Arg Asn Arg Lys Val Pro Thr Phe Leu Tyr Ala Met Pro 
275 280 285 

Phe Ser Ser Asn Lys He Phe Leu Glu Glu Thr Ser Leu Val Ala Ara 
290 295 300 

Pro Gly Leu Arg Met Asp Asp He Gin Glu Arg Met Val Ala Arq Leu 
305 310 315 320 

Asn His Leu Gly He Lys Val Lys Ser lie Glu Glu Asp Glu His Cvs 
325 330 335 

Val He Pro Met Gly Gly Ser Leu Pro Val lie Pro Gin Arg Val Val 
340 345 350 

Gly Thr Gly Gly Thr Ala Gly Leu Val His Pro Ser Thr Gly Tyr Met 
355 360 365 

Val Ala Arg Thr Leu Ala Ala Ala Pro Val Val Ala Asn Ala He He 
370 375 380 

His Tyr Leu Gly Ser Glu Lys Asp Leu Leu Gly Asn Glu Leu Ser Ala 
385 390 395 400 

Ala Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arg Gin Arg Glu 
. ,.405 410 415 

Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp Leu Pro Ala 
420 425 430 

Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro Arq Tyr Tro 
435 440 445 

His Gly Phe Leu Ser Ser Arg Leu Tyr Leu Pro Glu Leu lie Phe Phe 
450 455 460 

Gly Leu Ser Leu Phe Ser Arg Ala Ser Asn Thr Ser Arg lie Glu lie 
465 470 475- " 480 

Met Thr Lys Gly Thr Leu Pro Leu Val Asn Met He Asn Asn Leu Leu 
485 490 495 

Gin Asp Thr Glu 

. 500 



<210> 60 
<211> 511 
<212> PRT 

<213> Tagetes erecta 
<400> 60 

Met Asp Thr Phe Leu Arg Thr Tyr Asn Ser Phe Glu Phe Val His Pro 
1 5 10 15 
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Ser Asn Lys Phe Ala Gly Asn Leu Asn Asn Leu Asn Gin Leu Asn Gin 
20 2 5 30 

Ser Lys Ser Gin Phe Gin Asp Phe Arg Phe Gly Pro Lys Lys Ser Gin 
35 40 45 

Phe Lys Leu Gly Gin Lys Tyr Cys Val Lys Ala Ser Ser Ser Ala Leu 
50 ^ 55 60 

Leu Glu Leu Val Pro Glu lie Lys Lys Glu Asn Leu Asp Phe Asp Leu 
65 70 75 80 

Pro Met Tyr Asp Pro Ser Arg Asn Val Val Val Asp Leu Val Val Val 
8 5 90 95 

Gly Gly Gly Pro Ser Gly Leu Ala Val Ala Gin Gin Val Ser Glu Ala 
100 105 110 

Gly Leu Thr Val Cys Ser lie Asp Pro Ser Pro Lys Leu lie Trp Pro 
115 120 125 

Asn Asn Tyr Gly Val Trp Val Asp Glu Phe Glu Ala Met Asp Leu Leu 
130 135 140 

Asp Cys Leu Asp Thr Thr Trp Ser Ser Ala Val Val Tyr lie Asp Glu 
145 * 150 155 160 

Lys Ser Thr Lys Ser Leu Asn Arg Pro Tyr Ala Arg Val Asn Arg Lys 
165 170 175 

Gin Leu Lys Thr Lys Met Leu Gin Lys Cys He Ala Asn Gly Val Lys 
180 185 190 

Phe His Gin Ala Lys Val He Lys Val He His Glu Glu Leu Lys Ser 
195 200 205 

Leu Leu He Cys Asn Asp Gly Val Thr He Gin Ala Thr Leu Val Leu 
210 215 220 

Asp Ala Thr Gly Phe Ser Arg Ser Leu Val Gin Tyr Asp Lys Pro Tyr 
225 230 235 240 

Asn Pro Gly Tyr Gin Val Ala Tyr Gly He Leu Ala Glu Val Glu Glu 
245 250 255 

His Pro Phe Asp Val Asp Lys Met Leu Phe Met Asp Trp Arg Asp Ser 
260 265 270 

His Leu Asp Gin Asn Leu Glu He Lys Ala Arg Asn Ser Arg He Pro 
275 280 285 

Thr Phe Leu Tyr Ala Met Pro Phe Ser Ser Thr Arg He Phe Leu Glu 
290 295 300 

Glu Thr Ser Leu Val Ala Arg Pro Gly Leu Lys Met Glu Asp He Gin 
305 310 " 315 320 

Glu Arg Met Ala Tyr Arg Leu Lys His Leu Gly He Lys Val Lys Ser 
325 330 335 



He Glu Glu Asp Glu Arg Cys Val He Pro Met Gly Gly Pro Leu Pro 
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34 0 34 5 350 

Val Leu Pro Gin Arg Val Leu Gly He Gly Gly Thr Ala Gly Met Val 

355 360 " 365 

His Pro Ser Thr Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro 

370 375 380 

He Val Ala Lys Ser He He Arg Tyr Leu Asn Asn Glu Lys Ser Met 

385 390 395 " 400 

Val Ala Asp Val Thr Gly Asp Asp Leu Ala Ala Gly He Trp Arg Glu 

405 410 415 

Leu Trp Pro He Glu Arg Arg Arg Gin Arg Glu Phe Phe Cys Phe Gly 

420 425 430 

Met Asp He Leu Leu Lys Leu Asp Leu Glu Gly Thr Arg Arg Phe Phe 

4 35 44 0 445 

Asp Ala Phe Phe- Asp Leu Glu Pro Arg Tyr Trp His Gly Phe Leu Ser 

450 455 460 

Ser Arg Leu Phe Leu Pro Glu Leu Val Thr Phe Gly Leu Ser Leu Phe 

465 470 475 480 

Gly His Ala Ser Asn Thr Cys Arg Val Glu He Met Ala Lys Gly Thr 

485 490 ~ 495 

Leu Pro Leu Ala Thr Met He Gly Asn Leu Val Arg Asp Arg Glu 

500 505 510 



<210> 61 
<211> 503 
<212> PRT 
<213> Daffodil 

<400> 61 

Met Asp Thr Leu Leu Arg Thr His Asn Arg Leu Glu Leu Leu Tyr Pro 
1 ' 5 10 15 

Leu His Glu Leu Ala Lys Arg His Phe Leu Ser Pro Ser Pro Asn Pro 
20 25 30 

Gin Asn Pro Asn Phe Lys Phe Phe Ser Arg Lys Pro Tyr Gin Lys Lys 
35 40 " 45 

Cys Arg Asn Gly Tyr He Gly Val Ser Ser Asn Gin Leu Leu Asp Leu 
50 55 60 

Val Pro Glu lie Lys Lys Glu His Leu Glu Phe Asp Leu Pro Leu Tyr 
65 70 75 * 80 

Asp Pro Ser Lys Ala Leu Thr Leu Asp Leu Ala Val Val Gly Gly Gly 
85 90 ~ 95 

Pro Leu Ala Arg Ser Cys Ser Thr Ser Leu Gly Gly Gly Leu Ser Val 
100 ' 105 "* 110 

Val Ser He Asp Pro Asn Pro Lys Leu He Trp Pro Asn Asn Tyr Gly 
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115 120 125 

Val Trp Val Asp Glu Phe Glu Asp Met Asp Leu Leu Asp Cys Leu Asp 
130 135 140 

Ala Thr Trp Ser Gly Ala He Val Tyr Val Asp Asp Arg Ser Thr Lys 

150 155 * ~ 160 

Asn Leu Ser Arg Pro Tyr Ala Arg Val Asn Arg Lys Asn Leu Lys Ser 
165 170 175 

Lys Met Met Lys Lys Cys Val Ser Asn Gly Val ,Arg Phe His Gin Ala 
180 185 190 

Thr Val Val Lys Ala Met His Glu Glu Glu Lys Ser Tyr Leu He Cvs 
195 200 * 205 

Ser Asp Gly Val Thr He Asp Ala Arg Val Val Leu Asp Ala Thr Glv 
210 215 220 

Phe Ser Arg Cys Leu Val Gin Tyr Asp Lys Pro Tyr Asn Pro Gly Tvr 
225 230 235 240 

Gin Val Ala Tyr Gly He Leu Ala Glu Val Glu Glu His Pro Phe Asp 
245 250 255 

Val Asp Lys Met Val Phe Met Asp Trp Arg Asp Ser His Leu Asn Glv 
260 265 270 

Lys Ala Glu Leu Asn Glu Arg Asn Ala Lys lie Pro Thr Phe Leu Tvr 
2 75 280 285 

Ala Met Pro Phe Ser Ser Asn Arg lie Phe Leu Glu Glu Thr Ser Leu 
290 295 300 

Val Ala Arg Pro Gly Leu Lys Met Glu Asp He Gin Glu Arq Met Val 
305 310 315 320 

Ala Arg Leu Asn His Leu Gly lie Arg lie Lys Ser lie Glu Glu Asp 
325 330 335 

Glu Arg Cys Val lie Pro Met Gly Gly Pro Leu Pro Val lie Pro Gin 
340 345 350 

Arg Val Val Gly lie Gly Gly Thr Ala Gly Met Val His Pro Ser Thr 
355 360 365 

Gly Tyr Met Val Ala Arg Thr Leu Ala Ala Ala Pro lie Val Ala Asn 
370 375 380 

Ser lie Val Gin Tyr Leu Val Ser Asp Ser Gly Leu Ser Gly Asn Asp 
385 390 395 400 

Leu Ser Ala Asp Val Trp Lys Asp Leu Trp Pro lie Glu Arg Arg Arq 
405 410 415 

Gin Arg Glu Phe Phe Cys Phe Gly Met Asp lie Leu Leu Lys Leu Asp 
420 425 430 

Leu Glu Gly Thr Arg Arg Phe Phe Asp Ala Phe Phe Asp Leu Glu Pro 
435 440 445 
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Arg Tyr Trp His Gly Phe Leu Ser Ser Arg Leu Phe Leu Pro Glu Leu 
450 455 460 

Val Pro Phe Gly Leu Ser Leu Phe Ser His Ala Ser Asn Thr Cys Lvs 
465 470 475 480 

Leu. Glu lie Met Ala Lys Gly Thr Leu Pro Leu Val Asn Met He Asn 
485 490 495 

Asn Leu Val Gin Asp Arg Asp 
500 
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